SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: Choi et . al . 

(ii) TITLE OF INVENTION: Streptococcus pneumoniae Antigens and 
Vaccines 

(iii) NUMBER OF SEQUENCES: 452 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences , Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockyille 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: HP Vectra 4 86/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 



(-vi) CURRENT APPLICATION DATA: 

(A) * APPLICATION NUMBER : Unassigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 08/961, 083 

(B) FILING DATE:OCT-30-1997 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Michelle S. Marks 

(B) REGISTRATION NUMBER: 41,971 

(C) REFERENCE /DOCKET NUMBER: PB34 0P2C1 



( vi ) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (301) 309-8504 
•(B) TELEFAX: ( 30 1) 309-85 12 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS-: ... 

(A) LENGTH: 1999 base pairs 
, - (B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) - SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TAAAATCTAC GACAATAAAA ATCAACTCAT TGCTGACTTG GGTTCTGAAC GCCGCGTCAA .60 

TGCCCAAGCT AATGATATTC CCACAGATTT GGTTAAGGCA ATCGTTTCTA TCGAAGAC C A 12 0 

TCGCTTCTTC GACCACAGGG GGATTGATAC CATCCGTATC CTGGGAGCTT TCTTGCGCAA 180 

TCTGCAAAGC AATTCCCTCC AAGGTGGATC AACTCTCACC CAACAGTTGA TTAAGTTGAC 240 

TTACTTTTCA ACTTCGACTT CCGACCAGAC TATTTCTCGT AAGGCTCAGG AAGCTTGGTT 3 00 . 

AGCGATTCAG TTAGAACAAA AAGCAACCAA GCAAGAAATC TTGAC CTACT ATATAAATAA 3 60 

GGTCTACATG TCTAATGGGA ACTATGGAAT GCAGACAGCA GCTCAAAACT ACTATGGTAA 42 0 

AGACCTCAAT AATTTAAGTT TACCTCAGTT AGCCTTGCTG GCTGGAATGC CTCAGGCACC 480 

AAACCAATAT GACCCCTATT CACATCCAGA AGCAGCCCAA GACCGCCGAA ACTTGGTCTT .540 

ATCTGAAATG AAAAATCAAG GCTACATCTC TGCTGAACAG TATGAGAAAG CAGTCAATAC 600 

ACCAATTACT GATGGACTAC AAAGTCTCAA ATCAGCAAGT AATTACCCTG CTTACATGGA 660 

TAATTACCTC AAGGAAGTCA TCAATCAAGT TGAAGAAGAA ACAGGCTATA ACCTACTCAC . 72 0 

AACTGGGATG GATGTCTACA CAAATGTAGA CCAAGAAGCT CAAAAACATC TGTGGGATAT 78 0 

TTACAATACA GACGAATACG TTGCCTATCC AGACGATGAA TTGCAAGTCG CTTCTACCAT 840 

TGTTGATGTT TCTAACGGTA AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA 900 

TGTTTCCTTC GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 960 

. ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA CTGCTACTAT 102 0 

CGTTC AC GAT GAGCCCTATA ACTACCCTGG GACAAATACT CCTGTTTATA ACTGGGATAG 1080 

GGGCTACTTT GGCAACATCA CCTTGCAATA CGCCCTGCAA CAATCGCGAA ACGTCCCAGC 1140 

CGTGGAAACT CTAAACAAGG TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTAGG 1200 

■AATCGACTAC CCAAGTATTC ACTACTCAAA . TGCCATTTCA AGTAACACAA CCGAATCAGA 12 60 

CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGC TGCT TACGCTGCCT TTGCAAATGG 1320 

TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC TTTAGTGATG GGAGTGAAAA .13 8 0 

AGAGTTCTCT AATGTC GGAA CTCGTGCCAT GAAGGAAACG ACAGCCTATA TGATGACCGA 1440 

CATGATGAAA ACAGTCTTGA CTTATGGAAC TGGACGAAAT GCCTATCTTG CTTGGCTCCC 1500 

TCAGGCTGGT AAAACAGGAA CCTCTAACTA TACAGACGAG GAAATTGAAA ACCACATCAA " 1560 
GACCTCTCAA • TTTGTAGCAC CTGATGAACT ATTTGC TGGC TATACGCGTA AATATTCAAT . 1620 

GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT— GTAGGCAATG GCCTTACGGT 1680 

CGCTGCCAAA GTTTACCGCT CTATGATGAC CTACCTGTCT GAAGGAAGCA ATCCAGAAGA 1740 
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TTGGAATATA CCAGAGGGGC * TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG ' 1800 

TTCTACGTGG AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1860 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA. AGCACAAATA ATAGTACGAC 1920 

TAC C AATCCT AACAATAATA CGCAACAATC AAATACAACC CGTGATCAAC AAAATCAGAA . 198 0 

TCCTCAACCA GCACAACCA 1999 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 666 amino acids ----- 

(B) TYPE: amino acid 

(C) STRANDEDNESS-: single 
, (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Lys lie Tyr Asp Asn Lys Asn Gin Leu lie Ala Asp Leu Gly Ser Glu 

1 5 10 15 

Arg Arg Val Asn Ala Gin Ala Asn Asp lie Pro Thr Asp Leu Val Lys - 
20 25 30 

Ala He Val Ser He Glu Asp His Arg Phe Phe Asp His Arg Gly He 
35 40 45 

Asp Thr He Arg He Leu Gly Ala Phe Leu Arg Asn Leu Gin Ser Asn 
50 55 60 

Ser Leu Gin Gly Gly Ser Thr. Leu Thr Gin Gin Leu He Lys Leu Thr 
65 . 70 75 80 

Tyr Phe Ser Thr Ser Thr Ser Asp Gin Thr He Ser Arg Lys Ala Gin 
85 90 95 

Glu Ala, Trp Leu Ala lie Gin Leu Glu Gin Lys Ala Thr Lys Gin Glu 
100 105 110 

lie Leu Thr Tyr Tyr lie' Asn Lys Val Tyr Met Ser Asn Gly Asn Tyr 
- " 115 : 120 125 

Gly Met Gin Thr Ala Ala Gin Asn Tyr Tyr Gly Lys Asp Leu Asn Asn 
130 135 140 ' 

Leu Ser. Leu Pro Gin Leu Ala Leu Leu Ala Gly' Met Pro Gin Ala Pro 

145 ^ - 150 ' 155 / 160 

Asn Gin Tyr Asp Pro Tyr Ser His Pro Glu Ala Ala Gin Asp Arg Arg 

_ 165 170 - . 17 5 

Asn Leu Val Leu Ser Glu Met Lys Asn Gin Gly Tyr He Ser Ala Glu 

180 185 190 

Gin Tyr- Glu Lys Ala Val Asn.- Thr Pro He Thr Asp -Gly— Leu-Gln_Ser 

. - - 1-95 - 200 . - • ' . \ 205 . - 

__Leu Lys Ser Ala Ser Asn" Tyr Pro Ala Tyr Met Asp Asn Tyr . Leu Lys 
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210 215 220 

Glu Val lie Asn Gin Val Glu Glu Glu Thr Gly Tyr Asn Leu. Leu Thr 
225 - 230 - -235 c 240 

Thr Gly Met Asp Val Tyr Thr Asn Val Asp Gin Glu Ala Gin Lys His 

245 .250 , 255 

Leu Trp Asp lie Tyr Asn Thr Asp Glu Tyr Val Ala Tyr Pro Asp Asp 
260 265 270 

Glu Leu Gin Val Ala Ser Thr lie Val Asp Val Ser Asn Gly Lys Val 
275 280 285 

lie Ala Gin Leu Gly Ala Arg His Gin Ser Ser Asn Val Ser Phe Gly 
290 295 300 

lie Asn Gin Ala Val Glu Thr- Asn Arg Asp Trp Gly Ser Thr Met Lys 
305 310 315 320 

Pro lie Thr Asp Tyr Ala 7 Pro Ala Leu~GIu"Tyr Gly Val Tyr Asp Ser 
325 330 335 

Thr Ala Thr lie Val His Asp Glu Pro Tyr Asn Tyr Pro Gly Thr. Asn 
340 345 350 

Thr Pro Val Tyr Asn Trp Asp Arg Gly Tyr Phe Gly Asn lie Thr Leu 
355 360 365 

Gin Tyr Ala Leu Gin Gin Ser Arg Asn Val Pro Ala Val Glu Thr Leu 
370 375 380 

Asn Lys Val Gly Leu Asn Arg Ala Lys Thr Phe Leu Asn Gly Leu Gly 
385 390 . 395 400 

lie Asp Tyr Pro Ser He His Tyr Ser Asn Ala He Ser Ser Asn Thr 
405 410 ' 415 

Thr Glu Ser Asp Lys Lys Tyr Gly Ala Ser Ser Glu. Lys Met Ala Ala 
420 425 43 0 

Ala Tyr Ala Ala Phe Ala Asn Gly Gly Thr Tyr Tyr Lys Pro Met Tyr 
435 440 445 

He His Lys Val Val Phe Ser Asp Gly Ser Glu Lys 'Glu Phe Ser Asn 
450 455 460 

Val Gly Thr Arg. Ala Met Lys Glu Thr. Thr Ala Tyr Met Met Thr Asp 
465 '- 470 475 480 

Met Met Lys Thr Val Leu Thr Tyr Gly Thr Gly Arg Asn Ala Tyr Leu 
485. 490 495 

Ala Trp Leu Pro Gin Ala Gly Lys Thr - Gly Thr Ser Asn Tyr Thr Asp 
500 505 • 510 

Glu Glu He Glu Asn His He Lys Thr Ser Gin Phe Val Ala Pro Asp 
515 ' 520 , 525 



Glu Leu Phe Ala Gly Tyr ..Thr Arg Lys_Tyr Ser.Met Ala Val Trp Thr 
530 535—— _ 5 40-- 



Gly Tyr Ser Asn Arg Leu Thr Pro Leu Val Gly Asn Gly Leu Thr Val 
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545 550 555 560 

Ala Ala Lys Val Tyr Arg Ser Met Met Thr Tyr Leu Ser Glu Gly'Ser 
565 - 570 - -57-5— 

Asn Pro- Glu Asp Trp Asn lie Pro Glu Gly Leu Tyr Arg Asn Gly Glu 
580 585_ ^ 590 

Phe Val Phe Lys Asn Gly Ala Arg Ser Thr Trp Asn Ser Pro Ala Pro 
595 600 605 . 

Gin Gin Pro- Pro Ser Thr Glu Ser Ser Ser Ser Ser Ser Asp Ser Ser 
610 615 * 620 

Thr Ser Gin Ser Ser Ser Thr Thr Pro Ser Thr Asn Asn Ser Thr Thr 
625 630 635 640 

Thr Asn Pro Asn Asn Asn Thr Gin Gin Ser Asn Thr Thr Pro Asp Gin 
- 645 ~ 650 655 

Gin Asn Gin Asn Pro Gin Pro Ala Gin p ro -™— ^ - 
660 665 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1714 base pairs 

(B) TYPE: nucleic acid 

- (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

' AAATTACAAT ACGGACTATG AATTGACCTC TGGAGAAAAA TTACCTCTTC CTAAAGAGAT ' 60 
TTCAGGTTAC ACTTATATTG GATATATCAA AGAGGGAAAA ACGACTTCTG AGTCTGAAGT '12 0 

AAGTAATCAA AAGAGTTCAG TTGCCACTCC TACAAAACAA CAAAAGGTGG ATTATAATGT 180 

TACACCGAAT TTTGTAGACC ATCCATCAAC AGTACAAGCT ATTCAGGAAC AAACACCTGT . 240 

TTCTTCAACT AAGCCGACAG AAGTTCAAGT AGTTGAAAAA CCTTTCTCTA CTGAATTAAT 300 

CAATCCAAGA AAAGAAGAGA AACAATCTTC AGATTCTCAA GAACAATTAG CCGAACATAA 3 60 

GAATCTAGAA ACGAAGAAAG AGGAGAAGAT TTCTCCAAAA GAAAAGACTG GGGTAAATAC . 42 0 

ATTAAATCCA CAGGATGAAG TTTTATCAGG TCAATTGAAC AAAC CTGAAC TCTTATATCG ,48 0 

TGAGGAAACT ATGGAGACAA AAATAGATTT TCAAGAAGAA" ATTCAAGAAA ATCCTGATTT 54 0 

AGCTGAAGGA ACTGTAAGAG TAAAACAAGA AGGTAAATTA GGTAAGAAAG TTGAAATCGT 600 

CAGAATATTC TCTGTAAACA AGGAAGAAGT TTCGCGAGAA ATTGTTTCAA CTTCAACGAC 660 

TGCGCCTAGT CCAAGAATAG TCGAAAAAGG TACTAAAAAA ACTCAAGTTA TAAAGGAACA 72 0 

ACCTGAGACT GGTGTAGAAC ATAAGGACGT ACAGTCTGGA GCTATTGTTG AACCCGCAAT 78 0 

TCAGCCTGAG TTGCCCGAAG CTGTAGTAAG TGACAAAGGC GAACCAGAAG TTCAACCTAC 84 0 
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ATT AC C C G AA GCAGTTGTGA CCGACAAAGG TGAGACTGAG GTTCAACCAG AGTCGCCAGA 900 

TACTGTGGTA AGTGATAAAG GTGAACCAGA GCAGGTAGCA CCGCTTCCAG AATAT AAGGG 960 

TAATATTGAG CAAGTAAAAC CTGAAACTCC GGTTGAGAAG ACCAAAGAAC AAGGTCCAGA -1020 

AAAAACTGAA GAAGTTCCAG TAAAACCAAC AGAAGAAACA CCAGTAAATC CAAATGAAGG 108 0 

TACTACAGAA GGAACCTCAA TTCAAGAAGC AGAAAATCCA GTTCAACCTG CAGAAGAATC ■ 1140 

AACAACGAAT TCAGAGAAAG TATCACCAGA TACATCTAGC AAAAATACTG GGGAAGTGTC 12 00 

CAGTAATCCT AGTGATTCGA CAACCTCAGT TGGAGAATCA AATAAAC C AG AACATAATGA 12 60 

CTCTAAAAAT GAAAATTCAG AAAAAACTGT AGAAGAAGTT CCAGTAAATC CAAATGAAGG 13 20 

CACAGTAGAA GGTACCTCAA ATCAAGAAAC AGAAAAACCA GTTCAACCTG CAGAAGAAAC 13 80 

ACAAACAAAC TCTGGGAAAA TAGCTAAC QA AAATAC TGG A GAAGTATC C A ATAAACCTAG 1440 

TGATTCAAAA CCACCAGTTG AAGAATCAAA TCAACCAGAA AAAAACGGAA CTGCAACAAA 1500" 

ACCAGAAAAT TCAGGTAATA CAACATCAGA GAATGGACAA ACAGAACCAG AACCATCAAA 1560 

CGGAAATTCA AC TGAGGATG TTTCAACCGA ATCAAACACA TCCAATTCAA ATGGAAACGA 162 0 

AGAAATTAAA CAAGAAAATG AACTAGACCC TGATAAAAAG GTAGAAGAAC CAGAGAAAAC 168 0 

ACTTGAATTA AGAAATGTTT CCGACCTAGA GTTA 1714 

{2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: . 

. (A) LENGTH 571 amino acids . 

(B) TYPE: amino acid 

(C) - STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein " 

(xij SEQUENCE /DESCRIPTION: SEQ ID NO : 4 : 

Asn Tyr Asn Thr Asp Tyr Glu Leu Thr Ser Gly Glu Lys Leu Pro Leu 
1 5 10 15 

Pro Lys Glu lie Ser Gly Tyr Thr Tyr lie Gly Tyr lie Lys Glu Gly 
20 25 30 

Lys Thr Thr Ser Glu Ser Glu Val Ser Asn Gin Lys Ser Ser Val Ala ' 
35 40 ' 45 

Thr Pro Thr Lys Gin Gin Lys Val Asp Tyr Asn Val Thr Pro Asn Phe 
50 55 60 

Val Asp His Pro Ser Thr Val Gin Ala lie. Gin Glu Gin Thr Pro Val 
65 70 - 75 80 

Ser Ser Thr Lys Pro Thr Glu Val Gin Val Val Glu Lys Pro Phe Ser 

. 85 90 ' 95 



Thr Glu Leu lie Asn Pro Arg Lys Glu Glu Lys-Gln Ser Ser Asp Ser 
100 105 :110 
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Gin Glu Gin Leu Ala Glu His Lys Asn Leu Glu Thr Lys Lys Glu Glu 

115 120. 125 

Lys lie Ser Pro Lys Glu Lys Thr. Gly -Val Asn Thr Leu Asn Pro Gin 
130 * 135 . 140 

Asp Glu Val Leu Ser Gly Gin Leu Asn Lys Pro Glu Leu Leu Tyr Arg 
145 150 155 160. 

Glu Glu Thr Met Glu Thr Lys lie Asp Phe Gin g!u Glu lie Gin Glu 
165 170 175 

Asn Pro Asp Leu Ala Glu Gly Thr Val Arg Val Lys Gin Glu Gly Lys 
180 185 190 

Leu Gly Lys Lys Val Glu lie Val Arg lie Phe Ser Val Asn Lys Glu ' 

195 200 205 

Glu Val Ser Arg Glu lie Val Ser Thr Ser Thr Thr Ala Pro Ser Pro 
210 215 220 

Arg lie Val Glu Lys Gly Thr Lys Lys Thr Gin Val lie Lys Glu Gin 
225 230 235 - 240 

Pro Glu Thr Gly Val Glu His Lys Asp Val Gin Ser Gly Ala lie Val 
245 250 255 

Glu Pro Ala lie Gin Pro Glu Leu Pro Glu. Ala Val Val Ser Asp Lys 
260 265. 2.7 0 

Gly Glu Pro, Glu Val Gin Pro Thr Leu Pro Glu Ala Val Val Thr. Asp 

275 280 ■ ^ 285 

Lys Gly Glu Thr Glu Val Gin Pro Glu Ser Pro Asp Thr Val Val Ser 

290 295 ■' - 300 

Asp Lys Gly Glu Pro Glu Gin Val Ala Pro Leu Pro Glu Tyr Lys Gly 
305 310 315 320 

Asn lie Glu Gin Val Lys Pro Glu Thr Pro Val Glu Lys Thr Lys Glu 
325 330 335 

Gin Gly Pro Glu Lys Thr Glu Glu Val Pro Val Lys Pro Thr Glu Glu 
340 345 ^ 350 

Thr Pro Val Asn Pro Asn Glu Gly. Thr Thr Glu Gly Thr Ser lie Gin 

355 360 365 

Glu- Ala Glu Asn Pro Val Gin Pro Ala Glu Glu Ser Thr Thr Asn Ser 
370 375 380 

Glu Lys Val Ser Pro Asp Thr Ser Ser Lys Asn Thr Gly Glu Val Ser 
385 390- 395 400 

Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gly Glu Ser Asri Lys Pro 
405 . 410 . 415 

Glu His Asn Asp Ser Lys Asn Glu Asn Ser Glu Lys Thr Val Glu Glu 
420 425 .430 



Val Pro Val Asn Pro Asn Glu Gly Thr Val Glu Gly Thr Ser Asn-Gln 
' 435 440 .445 



118 



Glu Thr Glu Lys Pro Val Gin Pro Ala Glu Glu Thr Gin Thr Asn Ser 

450 455 460 

Gly Lys lie Ala Asn Glu Asn Thr Gly Glu Val Ser Asn Lys Pro Ser 

465 470 475 : 480 

Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gin Pro Glu Lys Asn Gly 

485 490 495 

Thr Ala Thr .Lys Pro Glu Asn Ser Gly Asn Thr Thr. Ser Glu Asn Gly 

500 .505 510 

Gin Thr Glu Pro Glu Pro Ser Asn Gly Asn Ser Thr Glu Asp Val Ser 

515 520 , 525 

Thr' Glu Ser Asn Thr Ser Asn Ser Asn Gly Asn Glu Glu lie Lys Gin 

530 535 . 540 

Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Glu Glu Pro Glu Lys Thr 

5.45 - 550 555 560 

Leu Glu Leu Arg Asn Val Ser Asp Leu Glu Leu 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TGAGAATCAA GCTACACCCA AAGAGACTAG CGCTCAAAAG ACAATCGTCC TTGCTACAGC 60 

TGGCGACGTG CCACCATTTG ACTACGAAGA CAAGGGCAAT CTGACAGGCT TTGATATCGA 12 0 

AGTTTTAAAG GCAGTAGATG AAAAACTCAG CGACTACGAG ATTCAATTCC AAAGAACCGC 18 0 

CTGGGAGAGC ATCTTCCCAG GACTTGATTC TGGTCACTAT CAGGCTGCGG CCAATAACTT 240 

GAGTTACACA AAAGAGCGTG CTGAAAAATA CCTTTACTCG CTTC CAATTT CCAACAATCC 3 00 

CCTCGTCCTT GTCAGCAACA AGAAAAATCC TTTGACTTCT CTTGACCAGA TCGCTGGTAA 3 60 

AACAACACAA GAGGATACCG GAACTTCTAA CGCTCAATTC ATCAATAACT GGAATCAGAA . 420 

ACACACTGAT AATCCCGCTA CAATTAATTT TTCTGGTGAG GATATTGGTA. AACGAATCCT 480 

AGACCTTGCT AACGGAGAGT TTGATTTCCT AGTTTTTGAC AAGGTATCCG TTCAAAAGAT 540 

TATCAAGGAC CGTGGTTTAG ACCTCTCAGT CGTTGATTTA CCTTCTGCAG ATAGCCCCAG 600 

CAATTATATC ATTTTCTCAA GCGACCAAAA AGAGTTTAAA GAGCAATTTG ATAAAGCGCT 660 

CAAAGAACTC TATCAAGACG GAACCCTTGA AAAACTCAGC AATACCTATC TAGGTGGTTC 720 

TTACCTCCCA GATCAATCTC AGTTACAA 748 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino -acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY.: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Glu Asn Gin Ala Thr Pro Lys Glu Thr Ser Ala Gin' Lys Thr lie Val 
1 .5 10 , 15 

Leu Ala Thr Ala Gly- Asp Val Pro Pro Phe Asp Tyr Glu Asp Lys Gly 

20 25 " 30- 

Asn Leu Thr Gly Phe Asp lie Glu Val Leu Lys Ala Val Asp Glu Lys 
35 40 45 

Leu Ser Asp Tyr Glu lie Gin Phe Gin Arg Thr Ala Trp Glu Ser lie 
50 '55 60 

Phe Pro Gly Leu Asp Ser Gly. His Tyr Gin Ala Ala Ala Asn Asn Leu 
65 70 75 80 

Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Leu Tyr Ser Leu Pro lie 
.85 90 .95 

Ser Asn Asn Pro Leu Val Leu Val Ser Asn Lys Lys Asn Pro Leu' Thr 
100 '105 ' 110 

Ser Leu- Asp Gin lie Ala Gly Lys Thr Thr Gin Glu Asp Thr Gly Thr 
115 - 120 125 

Ser Asn Ala Gin Phe lie Asn Asn Trp Asn Gin Lys His Thr Asp Asn 
130 135 140 . 

Pro Ala Thr lie Asn Phe Ser Gly Glu Asp lie Gly Lys Arg lie Leu 
145 150 155 160 

Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Val Phe Asp Lys Val Ser 
165 170 175 

Val Gin Lys lie lie Lys Asp Arg Gly Leu Asp Leu Ser Val Val Asp 
180 185 ' - 190 

Leu Pro Ser Ala Asp Ser Pro Ser Asn Tyr lie lie Phe Ser Ser Asp 
195 200 . 205 

Gin Lys Glu Phe Lys Glu Gin Phe Asp Lys Ala Leu Lys Glu L,eu Tyr 
210 215 220 

Gin Asp Gly Thr Leu Glu Lys Leu Ser Asn Thr Tyr Leu Gly Gly Ser 

225 . > .... 230 . • ... 235 \ 240 

Tyr Leu Pro Asp Gin Ser Gin Leu Gin 

245 



(2) INFORMATION FOR SEQ ID NO : 7; 



^ 3 



; :r3=. ■ 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 5 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

~(xi) , SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TGGTAACCGC TCTTCTCGTA ACGCAGCTTC ATCTTCTGAT .GTGAAGACAA AAGCAGCAAT 60 

CGTCACTGAT ACTGGTGGTG TTGATGACAA ATCATTCAAC CAATCAGCTT GGGAAGGTTT 120 

GCAGGCTTGG GGTAAAGAAC ACAATCTTTC AAAAGATAAC GGTTTC AC TT ACTTCCAATC 180 

AACAAGTGAA GCTGACTACG CTAACAACTT GCAACAAGCG GCTGGAAGTT' ACAACCTAAT 240 

CTTCGGTGTT GGTTTTGCCC TTAATAATGC AGTTAAAGAT GCAGCAAAAG AACACACTGA ■ 3 00 

CTTGAACTAT GTCTTGATTG ATGATGTGAT TAAAGACCAA AAGAATGTTG CGAGCGTAAC 3 60 

TTTC GCTGAT AATGAGTCAG GTTACCTTGC AGGTGTGGCT GCAGCAAAAA CAACTAAGAC 42 0 

AAAACAAGTT GGTTTTGTAG GTGGTATCGA ATCTGAAGTT ATCTCTCGTT TTGAAGCAGG 48 0 

ATTCAAGGCT GGTGTTGCGT CAGTAGACCC ATCTATCAAA GTCCAAGTTG ACTACGCTGG 540 

TTCATTTGGT GATGCGGCTA AAGGTAAAAC AATTGCAGCC GCACAATACG CAGCCGGTGC 600 

AGATATTGTT TACCAAGTAG CTGGTGGTAC AGGTGCAGGT GTCTTTGCAG AGGCAAAATC 660 

TCTCAACGAA AGCCGTCCTG AAAATGAAAA AGTTTGGGTT ATCGGTGTTG ATCGTGACCA 720 

AGAAGCAGAA* GGTAAATACA CTTCTAAAGA TGGCAAAGAA TCAAACTTTG TTCTTGTATC .780 

TAC TTTG AAA CAAGTTGGTA CAACTGTAAA AGATATTTCT AACAAGGCAG AAAGAGGAGA 840 

ATTCCCTGGC GGTCAAGTGA TCGTTTACTC ATTGAAGGAT AAAGGGGTTG AC TTGGC AGT 900 

AACAAACCTT TCAGAAGAAG GTAAAAAAGC TGTCGAAGAT GCAAAAGCTA AAATCCTTGA 960 

TGGAAGCGTA AAAGTTCCTG AAAAA 985 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 amino ' acids ' v 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: protein 
(xij SEQUENCE DESCRIPTION: SEQ ID NO : 8 : . 

Gly Asn Arg Ser Ser Arg Asn Ala* Ala Ser Ser Ser^Asp Val Lys Thr 
1 5 10 •" ' . 15 

Lys Ala Ala lie Val Thr Asp Thr G ly Gly Val Asp Asp Lys Ser Phe 

. 20 • 25 „_„ . .3 0 

Asn Gin Ser Ala Trp Glu Gly Leu Gin Ala Trp Gly Lys Glu His Asn 
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35 40 45 

Leu Ser Lys Asp Asn Gly Phe Thr Tyr Phe Gin Ser Thr Ser Glu Ala 
50 55 60 



■ Asp Tyr Ala Asn Asn Leu Gin Gin Ala Ala Gly Ser Tyr Asn Leu lie 
65 . ^ 70 75 80 

Phe Gly Val Gly Phe Ala Leu Asn Asn Ala Val Lys Asp Ala Ala Lys 
85 90 95 

Glu His Thr Asp Leu Asn Tyr Val Leu He Asp Asp Val He Lys Asp 
100 105 110 

Gin Lys Asn Val Ala Ser Val Thr Phe Ala Asp Asn Glu Ser Gly Tyr 
115 ' 120 125 

. Leu Ala Gly' Val Ala Ala Ala' Lys Thr Thr Lys Thr Lys Gin Val Gly 
130 135 140 

Phe Val Gly"Gly" lie Glu Ser Glu Val He Ser Arg Phe Glu Ala Gly 
145 150 155 160 

Phe Lys Ala Gly Val Ala Ser Val Asp Pro Ser He Lys Val Gin Val 
165 170 175 

Asp Tyr Ala Gly Ser Phe Gly Asp Ala Ala Lys Gly Lys Thr He Ala 
180 185 190 

Ala Ala Gin Tyr Ala Ala Gly- Ala Asp He Val Tyr Gin Val Ala Gly 
195 200 205 

Gly Thr Gly Ala Gly Val Phe Ala Glu Ala Lys Ser Leu Asn Glu Ser 
210 215 220 

Arg Pro Glu Asn Glu Lys. Val Trp Val He Gly Val Asp Arg Asp Gin 
,225 230 235 240 

Glu Ala Glu Gly Lys Tyr Thr Ser Lys Asp Gly Lys Glu Ser Asn Phe 
245 ^250 < 255 

Val Leu Val Ser Thr Leu Lys Gin Val Gly Thr Thr Val Lys Asp He 
260 265 ' 270 

Ser Asn Lys Ala Glu Axg Gly Glu Phe Pro Gly Gly Gin Val lie Val 
275 280 285 

Tyr Ser Leu Lys Asp Lys Gly Val Asp Leu Ala Val Thr Asn Leu Ser 
290 295 300 

Glu Glu Gly Lys Lys Ala Val Glu Asp Ala Lys Ala Lys He Leu Asp 

305 . 310 • 315 ■ 320 

-Gly Ser Val Lys Val Pro Glu Lys 

' 325 , 

(2) INFORMATION FOR SEQ ID NO: .9: 

(i) SEQUENClTcHARACTERISTICS : 

— (A) LENGTH: 1404 base pairs . — ^- — . ~ 

-- — — .-(B) TYPE: nucleic acid - " 

(C) STRANDEDNESS : double ■ - 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: ' 

TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGCTGATT CAGGTGACAA AC C TGTTATC 60 

AAAATGT AC C AAATC GGTGA C AAAC C AG AC AACTTGGATG AATTGTTAGC AAATGCCAAC 120 

AAAATCATTG AAGAAAAAGT TGGTGCCAAA TTGGATATCC AATACCTTGG CTGGGGTGAC 180 

TATGGTAAGA AAATGTCAGT TATCACATCA TCTGGTGAAA ACTATGATAT TGC CTTTGCA 240 

GATAACTATA TTGTAAATGC TCAAAAAGGT GCTTACGCTG ACTTGACAGA ATTGTACAAA - 3 00 

AAAGAAGGTA AAG AC CTTTA CAAAGCACTT GACCCAGCTT ACATCAAGGG TAATACTGTA 3 60 

AATGGTAAGA TTTACGCTGT TCCAGTTGCA GCCAACGTTG CATCATCTCA AAACTTTGCC 420 

TTCAACGGAA CTCTCCTTGC TAAATATGGT ATCGAT ATTT CAGGTGTTAC TTCTTACGAA - 480 

AC TC TTGAGC CAGTCTTGAA ACAAATCAAA GAAAAAGCTG CAGACGTAGT ACCATTTGCT 540 

ATTGGTAAAG TTTTCATCCC ATC TGATAAT TTTGAC TACC CAGTAGCAAA CGGTCTTCCA 600 

TTCGTTATCG ACCTTGAAGG CGATACTACT AAAGTTGTAA ACCGTTACGA AGTGCCTCGT 660 

TTCAAAGAAC ACTTGAAGAC TCTTCACAAA TTCTATGAAG . CTGGCTACAT TCCAAAAGAC 720 

GTCGCAACAA GCGATACTTC CTTTGACCTT CAACAAGATA CTTGGTTCGT TCGTGAAGAA 780 

ACAGTAGGAC CAGCTGACTA CGGTAACAGC TTGCTTTCAC GTGTTGCCAA CAAAGATATC 840 

CAAATCAAAC CAATTACTAA CTTCATCAAG NAAAACCAAA CAACACAAGT TGCTAACTTT : 900 
GTCATCTCAA ACAACTCTAA GAACAAAGAA AAATCAATGG AAATCTTGAA CCTCTTGAAT - 960 

ACGAAC C C AG AACTCTTGAA . CGGTCTTGTT TAC GGTCC AG AAGGCAAGAA. CTGGGAAAAA 1020 
ATTGAAGGTA AAGAAAACCG TGTTCGCGTT CTTGATGGCT ACAAAGGAAA CACTCACATG " " 1080 

GGTGGATGGA ACACTGGTAA CAACTGGATC CTTTACATCA ACGAAAACGT TACAGACCAA 1140 

C AAATC G AAA ATTCTAAGAA AGAATTGGCA GAAGCTAAAG AATCTCCAGC GCTTGGATTT 1200. 

ATCTTCAATA CTGACAATGT GAAATCTGAA ATCTCAGCTA TTGC TAAC AC AATGCAACAA 1260 

TTTGATACAG CTATCAACAC TGGTACTGTA GACCCAGATA AAGCGATTCC AGAATTGATG 1320 

GAAAAATTGA AATCTGAAGG TGCCTACGAA AAAGTATTGA ACGAAATGCA AAAACAATAC 13 80 

GATGAATTCT TGAAAAACAA AAAA " . 1404 



(2) INFORMATION FOR SEQ ID NO:10": 

( i ) SEQUENCE .CHARACTERISTICS : 

(A) LENGTH: 468 amino acids 
' (B) TYPE: amino acid \ 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii)' MOLECULE TYPE —protein _ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

, Cys Gly Asn Leu Thr Gly Asn Ser Lys Lys Ala Ala Asp Ser Gly Asp 

• 1 - 5 - 10 . .. ■ 15 

Lys Pro Val lie Lys Met Tyr Gin- lie Gly Asp Lys Pro Asp Asn Leu 

20 . 25 30 

Asp Glu Leu Leu Ala Asn Ala Asn Lys lie lie Glu Glu Lys Val Gly 
35 40 45 

^ Ala Lys Leu Asp He Gin Tyr Leu Gly Trp Gly Asp Tyr Gly Lys Lys 
50 .55 60 

Met Ser Val He Thr Ser Ser Gly Glu Asn Tyr Asp He Ala Phe Ala 
65 70 75 80 

Asp Asn Tyr He Val Asn Ala Gin Lys Gly Ala Tyr Ala Asp Leu Thr 
85 . 90 95 

Glu Leu Tyr Lys Lys Glu Gly"" Lys "Asp Leu"Tyr~'Lys~Ala Leu Asp Pro 
100 105 ■ 110 

Ala Tyr He Lys Gly Asn Thr Val Asn Gly Lys lie Tyr Ala Val Pro 
115 120 125 . 

Val Ala Ala Asn Val Ala Ser Ser Gin Asn Phe Ala Phe Asn Gly Thr 
130 135. 140 

Leu Leu Ala Lys Tyr Gly. He Asp lie Ser Gly Val Thr Ser Tyr Glu 
•145 - ; 150 155- 160 

Thr Leu Glu Pro Val Leu Lys Gin He Lys Glu Lys Ala Pro Asp Val 
. 165 170 175 

Val Pro Phe Ala He Gly Lys Val Phe He Pro Ser Asp Asn Phe Asp 
180 ' . . 185 * 190 

Tyr Pro Val Ala Asri Gly Leu Pro Phe Val He Asp Leu Glu Gly Asp 
195 200 '205 

Thr Thr. Lys Val Val Asn Arg Tyr Glu Val Pro Arg Phe Lys Glu His 
210 215 220 

Leu Lys> Thr Leu His Lys, Phe Tyr Glu Ala Gly Tyr He Pro Lys Asp 
225 * 230 235 240 

Val Ala Thr Ser Asp. Thr Ser Phe Asp Leu Gin Gin Asp Thr. Trp Phe 
245 250 255 

Val Arg Glu Glu Thr Val Gly Pro Ala Asp Tyr Gly Asn Ser Leu Leu 
260 265 . 270 

Ser Arg Val Ala Asn Lys Asp lie Gin He Lys Pro He Thr Asn Phe 
275 280 285 

lie Lys Xaa Asn Gin Thr Thr Gin Val Ala Asn Phe Val lie Ser Asn 
290 295 300_ 

Asn Ser Lys Asn Lys Glu Lys Ser Met Glu lie" Leu Asn Leu Leu Asn 

305. . • 310 : 315 ' --— 320 



Thr Asn Pro Glu Leu Leu Asn Gly Leu Val Tyr Gly Pro Glu Gly Lys 



-4 
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325 330 335 

Asn Trp Glu Lys lie Glu Gly Lys Glu Asn Arg Val Arg Val Leu Asp 

340 345 --- 350 - 



Gly Tyr Lys Gly Asn Thr His Met Gly Gly Trp Asn Thr Gly Asn Asn 

355 360 365 

Trp lie Leu Tyr lie Asn Glu Asn Val Thr Asp Gin Gin lie Glu Asn 
370 375 380 

Ser Lys Lys Glu Leu Ala Glu Ala Lys Glu Ser Pro Ala Leu Gly Phe 
385 390 395 400 

lie Phe Asn Thr Asp Asn Val Lys Ser Glu lie Ser Ala lie Ala Asn 
405 410 " 415 

.Thr Met Gin Gin Phe Asp Thr Ala lie Asn Thr Gly Thr Val Asp Pro 

420 425 430 

Asp Lys Ala lie Pro Glu Leu Met Glu Lys Leu Lys" Ser Glu~Gly~Ala""" 
435 440 445 



Tyr Glu Lys Val Leu Asn Glu Met Gin Lys Gin Tyr Asp Glu Phe Leu 

IJ1 450 455 460 

L~i Lys Asn Lys Lys 

*L (2) INFORMATION FOR SEQ ID NO : 11 : 

P ' ' 

1^ (i) SEQUENCE CHARACTERISTICS: ' . 

W (A) LENGTH: 937 base pairs 

jjy - (B) TYPE: nucleic acid 

p (C) STRANDEDNESS: double 

n (D) TOPOLOGY: linear * ' 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TGGTCAAGGA ACTGC TTCTA AAGACAACAA AGAGGCAGAA CTTAAGAAGG TTGACTTTAT 60 

CCTAGACTGG ACACCAAATA CCAACCACAC AGGGCTTTAT GTTGCCAAGG AAAAAGGTTA 120 

TTTCAAAGAA GCTGGAGTGG ATGTTGATTT GAAATTGCCA CCAGAAGAAA GTTCTTCTGA 180 

CTTGGTTATC AAC GGAAAGG C AC CATTTGC AGTGTATTTC CAAGACTACA TGGCTAAGAA 240 

ATTGGAAAAA GGAGCAGGAA TC ACTGC C GT TGCAGCTATT GTTGAACACA ATACATCAGG 3 00 

AATCATCTCT CGTAAATCTG ATAATGTAAG CAGTCCAAAA G AC TTGGTTG GTAAGAAATA 3 60 

TGGGACATGG AATGACCCAA CTGAACTTGC TATGTTGAAA ACC TTGGT AG AATCTCAAGG" 420 

TGGAGACTTT GAGAAGGTTG AAAAAGTACC AAATAACGAC TCAAACTCAA TC AC AC C GAT 480 

TGCCAATGGC GTCTTTGATA CTGCTTGGAT TTACTACGGT TGGGATGGTA TCCTTGCTAA 540 
ATCTCAAGGT GTAGATGCTA ACTTCATGTA CTTGAAAGAC TATGTCAAGG AGTTTGACTA "~600 

CTATTCACCA GTTATCATCG CAAACAACGA CTATCTGAAA GATAACAAAG --AAGAAGCTCG 660- 



CAAAGTCATC C AAGC CATC A AAAAAGGCTA CCAATATGCC ATGGAACATC CAGAAGAAGC '720 
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TGCAGATATT CTCATCAAGA ATGCACCTGA ACTCAAGGAA AAACGTGACT TTGTCATCGA 78 0 : 

ATCTCAAAAA TACTTGTCAA AAGAATACGC AAGCGACAAG GAAAAATGGG GTCAATTTGA 840 

CGCAGCTGGC TGGAATGCTT TCTACAAATG GGATAAAGAA AATGGTATCC TTAAAGAAGA 900 

CTTGACAGAC AAAGGCTTCA CCAACGAATT TGTGAAA 937 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 
. (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein J 
^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

SI Gly Gin Gly Thr Ala Ser Lys Asp Asn Lys Glu Ala Glu Leu Lys Lys 

Ol 1 5 10 15 

! 

m Val Asp Phe He Leu Asp Trp Thr Pro Asn Thr Asn His Thr Gly Leu 

20 25 30. \ ■ 

3-88 Tyr Val Ala Lys Glu Lys Gly Tyr Phe Lys Glu Ala Gly Val Asp Val 

!L 35 40-45 

Asp Leu Lys Leu Pro Pro Glu Glu Ser Ser Ser Asp Leu Val He Asn 

!y . 50 55 \ 60 

% y - * 

:;~5 Gly Lys Ala Pro Phe Ala Val Tyr Phe Gin Asp Tyr Met Ala Lys Lys 

J!! 65 .70 75 .80 

j^eu Glu Lys Gly Ala Gly He Thr Ala Val Ala Ala He Val Glu His 
85 90 95 

Asn Thr Ser Gly lie He Ser Arg Lys Ser Asp Asn Val Ser Ser Pro 
' 100 105 HO 

Lys Asp Leu Val Gly Lys Lys Tyr Gly Thr Trp Asn Asp Pro Thr Glu 
115 ' 120 125 

Leu Ala Met Leu Lys Thr Leu Val Glu Ser Gin Gly Gly Asp Phe Glu 
130 135 140 

Lys Val Glu Lys Val Pro Asn Asn Asp Ser Asn Ser lie thr Pro He 
145 150 . 155 160 

Ala Asn Gly Val Phe Asp Thr Ala Trp lie Tyr Tyr Gly Trp Asp Gly 
165 170 175 

He Leu Ala Lys Ser Gin Gly Val Asp Ala Asn Phe Met Tyr Leu Lys 
180 . 185 ' 190 

Asp Tyr Val Lys Glu "Phe Asp Tyr Tyr Ser Pro Val. He lie Ala Asn 
195 200 ' 205 



Asn Asp Tyr Leu Lys , Asp Asn Lys Glu Glu Ala Arg Lys Val He Gin 
210 215 . 220 . 
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3 Hs* 



Ala lie Lys Lys Gly -Tyr Gin Tyr Ala Met Glu His Pro Glu Glu Ala 

225 230 235 240 

_Ala Asp lie Leu lie Lys Asn Ala Pro Glu Leu Lys. .Glu Lys Arg Asp 
245 250 255. 

Phe Val lie Glu Ser Gin Lys Tyr Leu Ser Lys Glu Tyr Ala Ser Asp 
260 265 ' 270 

Lys Glu Lys Trp Gly Gin Phe Asp Ala Ala Arg Trp Asn Ala Phe Tyr 
275 280 285 

Lys Trp Asp Lys Glu Asn Gly lie Leu Lys Glu Asp Leu Thr Asp Lys 
290 295 300 

Gly Phe Thr Asn Glu Phe Val Lys 

305 * 310 

(2) INFORMATION FOR SEQ ID NO: 13: ■ - . 



^ (i) SEQUENCE CHARACTERISTICS: " " * . 
^ (A) LENGTH: 799 base pairs 
1=M (B) TYPE: nucleic acid 
iH (C) STRANDEDNESS : double 
fy . (D) TOPOLOGY: linear 

*~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

l^j TAGCTCAGGT GGAAACGCTG GTTCATCCTC TGGAAAAACA ACTGCCAAAG CTCGCACTAT 60 

CGATGAAATC AAAAAAAGCG GTGAACTGCG AATCGCCGTG TTTG GAG ATA AAAAACCGTT 12 0 

TGGCTACGTT GACAATGATG GTTCTACCAA GGTACGCTAC GATATTGAAC TAGGGAACCA 180 

ACTAGCTCAA GACCTTGGTG TCAAGGTTAA ATACATTTCA GTCGATGCTG CCAACCGTGC 240 

GGAATACTTG ATTTCAAACA AGGTAGATAT TACTCTTGCT AACTTTACAG TAACTGACGA 300 

ACGTAAGAAA C AAGTTG ATT TTGCCCTTCC ATATATGAAA. GTTTCTCTGG GTGTCGTATC ■ 3 60 

ACCTAAGACT GGTCTCATTA CAGACGTCAA ACAACTTGAA GGTAAAACCT TAATTGTCAC 420 

AAAAGGA 3 ACTGCTGAGA CTTATTTTGA AAAGAATCAT CCAGAAATCA AACTCCAAAA . 480 

ATACGAC CAA TACAGTGACT CTTACCAAGC TCTTCTTGAC GGAC GTGG AG ATGCCTTTTC. 540 

AACTGACAAT ACGGAAGTTC TAGCTTGGGC GCTTGAAAAT AAAGGATTTG AAGTAGGAAT 600 

TACTTCCCTC GGTGATCCCG AT AC C ATTGC GGCAGCAGTT CAAAAAGGCA ACCAAGAATT 660 

GCTAGACTTC ATCAATAAAG ATATTGAAAA ATTAGGCAAG GAAAACTTCT TCCACAAGGC . 720 

CTATGAAAAG ACACTTCACC CAACCTACGG TGACGCTGCT AAAGCAGATG ACCTGGTTGT 780 

TGAAGGTGGA AAAGTTGAT 799 

^ (2) INFORMATION FOR SEQ ID NO: 14: ^ 

(i) SEQUENCE CHARACTERISTICS: * ^ 

(A) LENGTH: 266 amino acids' 

. (B) TYPE: amino acid '' 
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( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:14: 

Ser Ser Gly Gly Asn Ala Gly Ser Ser Ser Gly Lys Thr Thr Ala Lys 
1 5 10 15 

Ala Arg Thr lie Asp Glu lie Lys Lys Ser Gly Glu Leu Arg lie Ala 

20 .25 30 

Val Phe Gly Asp Lys Lys Pro Phe Gly Tyr Val Asp Asn Asp Gly Ser 
35 40 45 

Thr Lys Val Arg Tyr Asp. lie Glu Leu Gly Asn Gin Leu Ala Gin Asp 
50 55 60 

Leu Gly Val Lys Val Lys Tyr lie Ser Val Asp Ala Ala Asn Arg Ala 
65 70 75 80 

Glu Tyr Leu lie Ser Asn Lys Val Asp lie Thr Leu Ala Asn Phe Thr 
85 90 95 

Val Thr Asp Glu Arg Lys Lys Gin Val Asp Phe Ala Leu Pro Tyr Met 
100 105 110 

Lys Val Ser Leu Gly Val Val Ser Pro Lys Thr Gly Leu lie Thr Asp 
115 120 125 

Val Lys Gin Leu Glu Gly Lys Thr Leu lie Val Thr Lys Gly Thr Thr 
130 135 140 ' 

Ala Glu Thr Tyr Phe Glu Lys Asn His Pro Glu lie Lys Leu Gin Lys 
145 150 155 160 

Tyr Asp Gin Tyr Ser Asp Ser Tyr Gin Ala Leu Leu Asp Gly Arg Gly 
165 170 175 

Asp Ala Phe Ser Thr Asp Asn Thr Glu Val Leu Ala Trp Ala Leu Glu 
180 185 190 

Asn Lys Gly Phe Glu Val Gly lie Thr Ser Leu Gly Asp Pro Asp Thr 
195 . 200 205 

lie Ala. Ala Ala Val Gin Lys Gly Asn Gin Glu Leu Leu Asp Phe lie 
210 215 220 

Asn Lys Asp lie Glu Lys Leu Gly Lys Glu Asn Phe Phe His Lys Ala 
225 230 235 240 

•Tyr Glu Lys Thr Leu His Pro Thr Tyr Gly Asp Ala Ala Lys Ala Asp 
245 250 255 

Asp Leu Val Val Glu Gly Gly Lys t Val Asp 
260 1 265 



(2-) — INFORMATION FOR SEQ ID NO: 15: 

- - (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1189 base pairs 
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{ B) TYPE: nucleic acid 
- (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 15 : 

CTCCAACTAT GGTAAATCTG CGGATGGCAC AGTGAC CATC GAGTATTTCA AC C AG AAAAA 60 

AGAAATG AC C AAAACCTTGG AAGAAATCAC TCGTGATTTT GAGAAGGAAA ACCCTAAGAT 120 

CAAGGTCAAA GTCGTCAATG TACCAAATGC TGGTGAAGTA TTGAAG AC AC GCGTTC TCGC 18 0 

AGGAGATGTG CCTGATGTGG TCAATATTTA CCCACAGTCC ATCGAACTGC AAGAATGGGC 240 

AAAAGCAGGT GTTTTTGAAG ATTTGAGCAA CAAAGACTAC CTGAAACGCG TGAAAAATGG 300 

CTACGCTGAA AAATATGCTG TAAACGAAAA AGTTTACAAC GTTCCTTTTA CAGCTAATGC 360 

Q TTATGGAATT TACTACAACA AAGATAAATT C GAAGAACTG GGCTTGAAGG TTCCTGAAAC 42 0 

C\ CTGGGATGAA TTTGAACAGT TAGTCAAAGA TATCGTTGCT AAAGGACAAA CACCATTTGG 480 

't! ' AATTGCAGGT GCAGATGCTT GGACACTCAA TGGTTACAAT CAATTAGCCT TTGCGACAGC 540 

IU AACAGGTGGA GGAAAAGAAG CAAATCAATA CCTTCGTTAT TCTCAACCAA ATGCCATTAA 600 

: iuA ATTGTC GGAT CCGATTATGA AAGATGATAT CAAGGTCATG GACATCCTTC GCATCAATGG 660 

g ATCTAAGCAA AAGAACTGGG AAGGTGCTGG CTATACCGAT GTTATCGGAG CCTTCGCACG 72 0 

!!! TGGGGATGTC CTCATGACAC CAAATGGGTC TTGGGC GATC ACAGCGATTA ATGAACAAAA 780 

VU ACCGAAGTTT AAGATTGGGA CCTTCATGAT TCCAGGAAAA GAAAAAGGAC AAAGCTTAAC 840 

y~ CGTTGGTGCG GGAGAC TTGG C ATGGTC TAT CTCAGCCACC ACCAAACATC CAAAAGAAGC 900. 

CAATGCCTTT GTGGAATATA TGACCCGTCC AGAAGTCATG CAAAAATACT AC GATGTGGA 960 

CGGATCTCCA ACAGCGATCG AAGGGGTCAA ACAAGCAGGA GAAGATTCAC CGCTTGCTGG 1020 

TATGACCGAA TATGCCTTTA CGGATCGTCA C TTGGTCTGG TTGCAACAAT ACTGGACCAG 1080 

TGAAGCAGAC TTCCATACCT TGACCATGAA CTATGTCTTG ACCGGTGATA AACAAGGCAT 1140 

GGTCAATGAT TTGAATGCCT TCTTTAACCC GATGAAAGCG GATGTGGAT , 118 9. 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 96 amino acids 

(B) TYPE:, amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ' ID- NO : 16 : 

Ser Asn Tyr.Gly Lys Ser Ala Asp Gly Thr Val Thr lie Glu Tyr Phe - > 

1 5 . .. 10 15 
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Asn Gin Lys Lys Glu Met Thr Lys Thr Leu Glu Glu lie Thr Arg Asp 
20 25 30 

Phe Glu Lys Glu Asn Pro Lys lie Lys Val Lys Val Val Asn Val Pro- 

_ —35 , 40 45 

Asn Ala Gly Glu Val Leu Lys Thr Arg Val Leu Ala Gly Asp Val Pro 
50 55 60 

Asp Val Val Asn lie Tyr Pro Gin Ser lie Glu Leu Gin Glu Trp Ala 
65 70 75 80 

Lys Ala Gly Val Phe Glu Asp Leu Ser Asn Lys Asp_ Tyr Leu Lys Arg 
85 90 95 

Val Lys Asn Gly Tyr Ala Glu Lys Tyr Ala Val Asn Glu Lys Val Tyr 
100 105 110 

Asn Val Pro Phe Thr Ala Asn Ala Tyr Gly lie Tyr Tyr Asn Lys Asp 

115 120 125 



Lys Phe Glu Glu Leu Gly Leu .Lys Val Pro Glu Thr Trp Asp Glu Phe 
130 135 140 

Glu Gin Leu Val Lys Asp lie Val Ala Lys Gly Gin Thr Pro Phe Gly 
145 150 155 160 

lie Ala Gly Ala Asp Ala Trp Thr Leu Asn Gly Tyr Asn Gin Leu Ala 
165 170 175 

Phe Ala .Thr Ala Thr Gly Gly Gly Lys Glu Ala Asn Gin Tyr Leu Arg 
180 185 190 

Tyr Ser Gin. Pro Asn Ala lie Lys Leu Ser Asp Pro lie Met Lys Asp 
195 200 205 

Asp lie Lys Val Met Asp lie Leu Arg lie Asn Gly Ser Lys Gin Lys 
210 215 220 

Asn Trp Glu Gly;*" Ala Gly Tyr Thr Asp Val lie Gly, Ala Phe Ala Arg 
225 . 230 235 240 

Gly Asp Val Leu Met Thr Pro Asn Gly Ser Trp Ala lie Thr Ala lie 
245 . 250 255 

Asn Glu Gin Lys Pro Asn Phe Lys lie Gly Thr Phe Met lie Pro Gly 
260 265 270 

Lys Glu Lys Gly Gin Ser Leu Thr Val Gly Ala Gly Asp Leu Ala Trp- 
275 280 285 

Ser lie Ser Ala Thr Thr Lys His Pro Lys Glu Ala Asn Ala Phe Val 

290 . 295 300 

Glu Tyr Met Thr Arg Pro Glu Val Met Gin Lys Tyr Tyr Asp Val Asp 

305 310 315 -., 320 

Gly Ser Pro Thr Ala lie Glu Gly Val Lys Gin Ala Gly Glu Asp Ser 
325 330 335 

Pro Leu Ala Gly Met Thr Glu Tyr Ala Phe Thr Asp Arg- His Leu Val 
340 — — 345 " 350 





r 
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Trp Leu Gin 
355 



Gin Tyr Trp Thr Ser Glu Ala Asp Phe His Thr Leu Thr 
360 365 



Met Asn Tyr 
370 



Val Leu Thr Gly Asp Lys Gin Gly Met Val Asn Asp Leu 
375 380 



Asn Ala Phe 
385 



Phe Asn Pro Met Lys Ala Asp Val Asp 
390 395 



(2) INFORMATION FOR SEQ ID NO: 17 : . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

'(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TGGGAAAAAT TCTAGCGAAA C T AGTGGAG A TAATTGGTCA AAGTACCAGT CTAACAAGTC 6 0 

TATTACTATT GGATTTGATA GTACTTTTGT TCCAATGGGA TTTGCTCAGA AAGATGGTTC 12 0 

TTATGCAGGA TTTGATATTG ATTTAGCTAC AGCTGTTTTT GAAAAATACG GAATCACGGT 18 0 

AAATTGGCAA CCGATTGATT GGGATTTGAA AGAAGCTGAA TTGACAAAAG GAACGATTGA 240 

TCTGATTTGG AATGGCTATT CCGCTACAGA CGAACGCCGT GAAAAGGTGG CTTTCAGTAA 3 00 

CTCATATATG AAGAATGAGC AGG TATTGGT TACGAAGAAA TCATCTGGTA TCACGACTGC 3 60 

AAAGGATATQ ACTGGAAAGA CAT T AGGAGC TCAAGCTGGT TCATCTGGTT ATGCGGACTT .42 0 

TGAAGCAAAT C C AG AAATTT TGAAGAATAT TGTCGC TAAT AAGGAAGCGA ATCAATACCA 48 0 
AACCTTTAAT GAAGCCTTGA TTGATTTGAA AAACGATCGA ATTGATGGTC 1 T ATTGATTGA _ 540 

CCGTGTCTAT GCAAACTATT ATTTAGAAGC AGAAGGTGTT TTAAACGATT A-TAATGTCTT' 600 

TACAGTTGGA CTAGAAACAG AAGCTTTTGC GGTTGGAGCC CGTAAGGAAG AT AC AAAC TT 660 

GGTTAAGAAG ATAAATGAAG CTTTTTCTAG TCTTTACAAG GACGGCAAGT TCCAAGAAAT 720 

CAGCCAAAAA TGGTTTGGAG AAGATGTAGC AACCAAAGAA GTAAAAGAAG GACAG 775 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 

(ii) * MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

_G.ly_Lys_Asn__Ser Ser Glu Thr Ser Gly Asp Asn Trp Ser Lys Tyr .Gin 

1 5 . 10 ...... . 15 



Ser Asn Lys Ser lie Thr lie Gly Phe Asp Ser Thr Phe Val Pro Met 



20 
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30 



Gly Phe Ala Gin Lys Asp Gly Ser Tyr Ala Gly Phe Asp lie Asp Leu 

- 35 — .- - 40 45 

Ala Thr Ala Val Phe Glu Lys Tyr Gly lie Thr Val Asn Trp Gin Pro 
50 55 60 

lie Asp Trp Asp Leu Lys Glu Ala Glu Leu Thr Lys Gly Thr lie Asp 
65 .70 75 80 

Leu lie Trp Asn- Gly Tyr Ser Ala Thr Asp Glu Arg Arg Glu Lys Val 
85 90 95 



Ala Phe Ser. Asn Ser Tyr Met Lys Asn Glu Gin Val Leu Val Thr Lys 
100 105 110 

Lys Ser Ser Gly lie Thr Thr Ala Lys Asp Met Thr Gly Lys Thr Leu 
115 * 120 125 . 

Gly Ala~Gln Ala" "Gly Ser Ser Gly Tyr Ala Asp Phe Glu Ala Asn Pro 
130 135 140 

Glu lie Leu Lys Asn lie Val Ala Asn Lys Glu Ala Asn Gin Tyr Gin 
145 150 155 160 

Thr Phe Asn Glu Ala Leu lie Asp Leu Lys Asn Asp Arg He Asp Gly 
165 170 175 

Leu Leu He Asp Arg Val Tyr Ala Asn Tyr Tyr Leu Glu Ala Glu Gly 
180 185 .190 

Val Leu Asn Asp Tyr Asn Val Phe Thr Val Gly Leu Glu Thr Glu Ala 
195 200 205 

Phe Ala Val Gly Ala Arg Lys Glu Asp Thr Asn Leu Val Lys Lys He 
210 215 220 

Asn Glu Ala Phe Ser' Ser Leu Tyr Lys Asp Gly Lys Phe Gin Glu He 
225 ' 230 235 240 

Ser Gin Lys Trp Phe Gly Glu Asp Val Ala Thr Lys Glu Val Lys Glu 
245 ' 250 ; 255 

Gly Gin 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 
... (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



: .(xi) SEQUENCE^ DESCRIPTION: SEQ ID NO: 19: 
- TGCTAGCGGA AAAAAAGATA . CAACTTCTGG TCAAAAACTA- AAAGTTGTTG— CTACAAACTC 



AATCATCGCT GATATTACTA AAAATATTGC- -TGGTGACAAA ATTGACCTTC , ATAGTATC GT 
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TCCGATTGGG CAAGACCCAC AC G AATAC G A ACCACTTCCT GAAGAC GTTA AGAAAACTTC 18 Q 

TGAGGCTAAT TTGATTTTCT ATAACGGTAT CAACCTTGAA ACAGGTGGCA ATGCTTGGTT 24 0 

TACAAAATTG GTAGAAAATG CCAAGAAAAC TGAAAACAAA GACTACTTCG CAGTCAGCGA 3 00 

CGGCGTTGAT GTTATCTACC TTGAAGGTCA AAATGAAAAA GGAAAAGAAG ACCCACACGC . 3 60 

TTGGCTTAAC CTTGAAAACG GTATTATTTT TGCTAAAAAT ATCGCCAAAC AATTGAGCGC 420 

CAAAGACCCT AACAATAAAG AATTCTATGA AAAAAATCTC AAAGAATATA CTGATAAGTT 48 0 

AGACAAACTT GATAAAGAAA GTAAGGATAA ATTTAATAAG ATCCCTGCTG AAAAGAAACT 540 

CATTGTAACC AGCGAAGGAG CATTCAAATA CTTCTCTAAA GCCTATGGTG TCCCAAGTGC 600 

TTACATCTGG GAAATCAATA CTGAAGAAGA AGGAACTCCT GAACAAATCA AGACCTTGGT 660 

TGAAAAACTT CGCCAAACAA AAGTTC CATC ACTCTTTGTA GAATCAAGTG TGGATGACCG 720 
TCCAATGAAA ACTGTTTCTC AAGACACAAA CATCCCAATC ~TACGCTCAAA ~TCTTTACTGA " 780 

CTCTATCGCA GAACAAGGTA AAGAAGGCGA CAGCTACTAC AGCATGATGA AATAC AACCT 840 

TGACAAGATT GCTGAAGGAT TGGCAAAA 8 68 
(2) INFORMATION FOR SEQ ID NO: 20: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Ala Ser Gly Lys Lys Asp Thr Thr Ser Gly Gin Lys Leu Lys Val' Val 
1 5 .10 15 

Ala Thr Asn Ser lie lie Ala Asp lie Thr Lys Asn, lie Ala Gly Asp 
20 25 ' 30 

Lys lie Asp Leu His Ser lie Val Pro lie Gly Gin Asp Pro His Glu 
35 40 45 

Tyr Glu Pro Leu Pro Glu Asp Val Lys Lys Thr Ser Glu Ala Asn Leu 
50 55 60 

lie Phe Tyr Asn Gly lie Asn Leu Glu Thr Gly Gly Asn Ala Trp Phe 
65 70 75 80 

Thr Lys Leu Val Glu Asn Ala Lys Lys Thr Glu Asn Lys Asp Tyr Phe 

85 90 ~" - * 95 . 

Ala Val Ser Asp Gly Val Asp Val lie Tyr Leu Glu Gly Gin Asn Glu 
100 105 110 

Lys Gly Lys Glu Asp Pro His Ala Trp Leu Asn--Leu Glu Asn Gly Ile- 
115 - 120 " 125 
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lie Phe Ala Lys Asn lie Ala Lys Gin Leu Ser Ala Lys Asp Pro Asn 
130 , 135 140 

Asn Lys Glu Phe Tyr Glu Lys Asn Leu Lys Glu Tyr Thr Asp Lys Leu 
145 150 155 ' 160 

Asp Lys Leu Asp Lys Glu Ser Lys Asp Lys Phe Asn Lys lie Pro Ala 
165 170 175 

Glu Lys Lys Leu lie Val Thr Ser Glu Gly Ala Phe Lys Tyr Phe Ser 
180 185 190 

Lys Ala Tyr Gly Val Pro Ser Ala Tyr lie Trp Glu lie Asn Thr Glu 
195 200 205 

Glu Glu Gly Thr Pro Glu Gin lie. Lys Thr Leu Val Glu Lys Leu Arg 
210 215 220 

Gin Thr Lys Val Pro Ser Leu Phe Val Glu Ser Ser Val Asp Asp Arg 
225 - 230 235 240 

Pro Met Lys Thr Val Ser Gin Asp Thr Asn lie Pro lie Tyr Ala Gin 
245 250 255 

lie Phe Thr Asp Ser lie Ala Glu Gin Gly Lys Glu Gly Asp Ser Tyr 
260 265 270 

Tyr Ser Met Met Lys Tyr Asn Leu Asp Lys lie Ala Glu Gly Leu Ala 
275 280 285 

Lys 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

TGGCTCAAAA AATACAGCTT CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CATTCCCGCT 60 

TCAAGAAAAG AAAACATTGA AGTTTATGAC AGCCAGTTCA CCGTTATCTC CTAAAGACCC 120 

AAATGAAAAG TTAATTTTGC AACGTTTGGA GAAGGAAACT GGCGTTCATA TTGACTGGAC . 180 

CAACTACCAA TCCGACTTTG C AG AAAAAC G TAACTTGGAT ATTTCTAGTG GTGATTTACC 240 

AGATGC TATC CACAACGACG GAGCTTCAGA TGTGGACTTG ATGAAC TGGG CTAAAAAAGG 3 00 

TGTTATTATT CCAGTTGAAG ATTTGATTGA TAAATACATG CCAAATCTTA AGAAAATTTT 3 60 

GGATGAGAAA CCAGAGTACA AGGCCTTGAT GACAGCACCT GATGGGCACA TTTACTCATT 420 

j.TCCATGGATT GAAGAGCTTG GAGATGGTAA AGAGTCTATT CACAGTGTCA ACGATATGGC 48 0 

TTGGATTAAC AAAGATTGGC TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CTACTGATGA . 54 0.. 
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TTTGATTAAA GTCCTAGAAG CTTTCAAAAA CGGGGATCCA AATGGAAATG GAGAGGCTGA 600 

TGAAATTCCA TTTTCATTTA TTAGTGGTAA C GGAAAC GAA GATTTTAAAT TCCTATTTGC 660 

TGCATTTGGT ATAGGGGATA ACGATGATCA TTTAGTAGTA GGAAATGATG GCAAAGTTGA 720 

CTTCACAGCA GATAACGATA ACTATAAAGA AGGTGTCAAA TTTATCCGTC AATTGCAAGA 780 

AAAAGGCCTG ATTGATAAAG AAGCTTTCGA ACATGATTGG AATAGTTACA TTGCTAAAGG 840 

TCATGATCAG AAATTTGGTG TTTACTTTAC ATGGGATAAG AATAATGT T A CTGGAAGTAA 900 

C G AAAGTT AT GAT GTTTT AC CAGTACTTGC TGGACCAAGT GGTCAAAAAC ACGTAGCTCG 960 

TACAAACGGT ATGGGATTTG CACGTGACAA GATGGTTATT ACCAGTGTAA AC AAAAAC CT 102 0 

AGAATTGACA GCTAAATGGA TTGATGCACA ATACGCTCCA CTCCAATCTG TGCAAAATAA 1080 

CTGGGGAACT TACGGAGATG ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA " 1140 

TAGTCTAAAA~C AC TTACCAC TAAACGGAAC TGCACCAGCA GAACTTCGTC AAAAGACTGA 12 00 

AGTAGGAGGA CCACTAGCTA TCC TAGATTC ATACTATGGT AAAGTAACAA CCATGCCTGA 12 60 

TGATGC C AAA TGGC GTTTGG ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT 13 20 

CAATAACTAT CCAAGAGTCT TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA 13 80 

AGCAGATATG AATG AC TATA TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT 1440 

TGATACTGAG . TGGGATGATT ACAAGAAAGA AC TTGAAAAA TACGGACTTT CTGATTACCT 1500 

CGC TATTAAA CAAAAATACT ACGACCAATA CCAAGCAAAC AAAAAC 1546 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 515 amino acids - 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE- TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Gly Ser Lys Asn Thr Ala Ser Ser Pro Asp Tyr Lys Leu Glu Gly Val 
1 5 10 15 

Thr Phe Pro Leu Gin Glu Lys Lys Thr Leu Lys Phe Met Thr Ala Ser 
20 25 30 

Ser Pro Leu Ser Pro Lys Asp Pro Asn Glu Lys Leu lie Leu Gin Arg 

35 40 45 

Leu Glu Lys Glu Thr Gly Val His lie Asp Trp Thr Asn Tyr Gin Ser 

50 55 60 . 

Asp Phe Ala .Glu Lys Arg Asn Leu Asp_Il.e_Ser_S_er_G.ly_ Asp Leu Pro 
65 "" .■-_._„ 70 .75 " - 80 

Asp Ala lie His Asn Asp Gly Ala Ser Asp Val Asp Leu Met Asn "Trp 
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85 .90 95 

Ala Lys Lys Gly Val lie He Pro Val Glu Asp Leu He Asp Lys Tyr 
100 105. - . 110 

Met Pro Asn Leu Lys Lys He Leu Asp Glu Lys Pro Glu Tyr Lys Ala 
115 120 125. 

Leu Met Thr Ala Pro Asp Gly His He Tyr Ser Phe Pro Trp He Glu 
.130 135 140 

Glu Leu Gly Asp Gly Lys Glu Ser He His - Ser Val Asn Asp Met Ala 
145 150 155 160 

Trp He Asn Lys Asp Trp Leu Lys Lys Leu Gly Leu Glu Met Pro Lys 
165 170 175 

Thr Thr Asp Asp Leu He Lys Val Leu Glu Ala Phe Lys Asn Gly Asp 
180 185 190 

Pro Asn Gly Asn Gly Glu Ala Asp Glu He Pro - Phe - Ser- Phe - He-Ser - 
195 200 205 

Gly Asn Gly Asn Glu Asp Phe Lys Phe Leu Phe Ala Ala Phe Gly He 
210 215 220 

Gly Asp Asn Asp Asp His Leu Val Val Gly Asn Asp Gly Lys Val Asp 
225 230 235 240 

Phe Thr Ala Asp Asn Asp Asn Tyr Lys Glu Gly Val Lys Phe He Arg 
245 250 255 

Gin Leu Gin Glu Lys Gly Leu He Asp Lys Glu Ala Phe- Glu His Asp 
260 - 265 270 

Trp Asn Ser Tyr He Ala Lys Gly His Asp Gin Lys Phe Gly Val Tyr 
275 280 , 285 

Phe Thr Trp Asp Lys Asn Asn Val Thr Gly Ser Asn Glu Ser Tyr Asp 
290 • 295 . 300 

Val Leu Pro Val Leu Ala Gly Pro Ser Gly Gin Lys His Val Ala Arg 
305 310 315 320 

Thr Asn Gly Met Gly Phe Ala Arg Asp Lys Met Val lie Thr Ser Val 
325 330 335 

Asn Lys Asn Leu Glu Leu Thr Ala Lys Trp He Asp Ala Gin Tyr Ala 
340 345 ' 350 

Pro Leu Gin Ser Val Gin Asn Asn Trp Gly Thr Tyr Gly Asp Asp Lys- 
355 360 • 365 

Gin Gin Asn He Phe Glu Leu Asp Gin Ala Ser Asn Ser Leu Lys His 
370 375 - ' 380 1 

Leu Pro Leu Asn Gly Thr Ala Pro Ala Glu Leu Arg Gin Lys Thr Glu 
385 390 39'5 400 



Val Gly Gly Pro Leu Ala He Leu Asp Ser Tyr Tyr Gly Lys Val Thr_ 
405 410 .I] — 415 

Thr Met Pro Asp Asp Ala Lys Trp Arg Leu Asp Leu lie .Lys Glu Tyr : 
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420 425 430 

Tyr Val Pro Tyr Met Ser Asn Val Asn Asn Tyr Pro Arg Val Phe Met 
435 440 445 ■ 

Thr Gin Glu Asp Leu Asp Lys lie Ala His lie Glu Ala Asp Met Asn 
450 455 460 

Asp Tyr lie Tyr Arg Lys Arg Ala Glu Trp lie Val Asn Gly Asn lie 
465 470 475 480 

Asp Thr Glu Trp Asp Asp Tyr Lys Lys Glu Leu Glu Lys Tyr Gly Leu 
485 490 495 

Ser Asp .Tyr Leu Ala lie Lys Gin Lys Tyr Tyr Asp Gin Tyr Gin Ala 
500 505 510 

Asn Lys Asn 
515 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi)- SEQUENCE . DESCRIPTION: SEQ ID NO: 23: 

TAGTACAAAC TCAAGCACTA GTCAGACAGA GACCAGTAGC TCTGCTCCAA CAGAGGTAAC 60 

CATTAAAAGT TCACTGGACG AGGTCAAACT TTCCAAAGTT CCTGAAAAGA TTGTGAC CTT 120 

TGACCTCGGC GCTGCGGATA CTATTCGCGC TTTAGGATTT GAAAAAAATA TCGTCGGAAT 18 0 

GCCTACAAAA ACTGTTCCGA CTTATCTAAA AGACCTAGTG GGAACTGTCA AAAATGTTGG 240 

TTC T ATG AAA GAACCTGATT TAGAAGCTAT CGCCGCCCTT GAGC CTGATT TGATTATCGC 3 00 

TTCGCCACGT ACACAAAAAT TCGTAGACAA ATTCAAAGAA ATCGCCCCAA CCGTTCTCTT 3 60 

CCAAGCAAGC AAGGACGACT ACTGGACTTC TACCAAGGCT AATATCGAAT CCTTAGCAAG 42 0 

TGCCTTCGGC GAAAC TGGTA CACAGAAAGC CAAGGAAGAA TTGACCAAGC TAGACAAGAG 480 

CATC C AAGAA GTCGCTACTA AAAATGAAAG CTCTGACAAA AAAGCCCTTG CGATCCTCCT 54 0 

TAATGAAGGA AAAATGGCAG CCTTTGGTGC CAAATCTCGT TTCTCTTTCT TGTACCAAAC 600 

CTTGAAATTC AAACCAACTG ATACAAAATT TGAAGACTCA CGCCACGGAC AAGAAGTCAG 660 

CTTTGAAAGT GTCAAAGAAA' TCAACCCTGA CATCCTCTTT GTCATCAACC GTACCCTTGC 72 0 

CATCGGTGGG GACAACTCTA GCAACGACGG TGTCCTAGAA AATGCCCTTA TCGCTGAAAC .780 

ACCTGCTGCT AAAAATGGTA AGATTATCCA ACTAACACCA GACCTCTGGT ATCTAAGCGG 840 
AGGCGGACTT GAATCAACAA AACTCATGAT TGAAGACATA CAAAAAGCTT TGAAA ~ . 895 
(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) — TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ser Thr Asn Ser Ser Thr Ser Gin Thr Glu Thr Ser Ser Ser Ala Pro 
1 5 10 15 

Thr Glu Val Thr lie Lys Ser Ser Leu Asp Glu Val Lys Leu Ser Lys 
20 .25 30 

Val Pro Glu Lys lie Val Thr Phe Asp Leu Gly Ala Ala Asp Thr lie 
35 ' 40 45 

Arg Ala Leu Gly Phe Glu Lys Asn lie Val Gly Met Pro Thr Lys Thr 
50 55 60 

Val Pro Thr Tyr Leu Lys Asp Leu Val Gly Thr Val Lys Asn Val Gly 
65 .70 75 80 

Ser Met Lys. Glu Pro Asp Leu Glu Ala lie Ala Ala Leu Glu Pro Asp 
85 90 . 95 

Leu lie lie Ala Ser Pro Arg Thr Gin Lys Phe Val Asp Lys Phe Lys 
100 105 110 

Glu lie Ala Pro Thr Val Leu Phe Gin Ala Ser Lys Asp Asp Tyr Trp 
115 120 125 

Thr Ser Thr Lys Ala Asn He Glu Ser Leu Ala Ser Ala Phe Gly Glu 
130 135 140 

Thr Gly Thr Gin Lys Ala Lys Glu Glu Leu Thr Lys Leu Asp Lys Ser 
145 150 155 . 160 

He Gin Glu Val Ala Thr Lys Asn Glu Ser Ser Asp Lys Lys Ala Leu 
165 170 , 175 

Ala He Leu Leu Asn Glu Gly Lys Met Ala Ala Phe Gly Ala Lys Ser 
180 185 190 

Arg Phe Ser Phe- Leu Tyr Gin Thr Leu Lys Phe Lys Pro Thr Asp Thr 
195 200 205 

Lys Phe Glu Asp Ser Arg His Gly Gin Glu Val Ser Phe Glu Ser Val 

2X0:- 215 • 22 0 

Lys Glu He Asn Pro Asp He Leu Phe Val He Asn Arg Thr Leu Ala 
225 _ 230 235 240 

He Gly__Gly Asp Asn Ser Ser Asn Asp Gly Val Leu Glu Asn Ala Leu 
245 250 255 



He Ala. Glu Thr -Pro Ala Ala Lys Asn Gly Lys lie He Gin Leu Thr 
260 265 270 
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Pro 'Asp Leu Trp Tyr Leu Ser Gly Gly Gly Leu Glu Ser Thr Lys Leu 

275 -■- 280 285 

Met lie Glu Asp lie Gin Lys Ala Leu Lys ._ 

290 295 

(2) INFORMATION FOR SEQ' ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 61 . base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO : 25 : 
TGGCAATTCT GGCGGAAGTA AAGATGCTGC CAAATCAGGT GGTGACGGTG CCAAAACAGA 
AATCACTTGG TGGGCATTCC CAGTATTTAC CCAAGAAAAA ACTGGTGACG" GTGTTGGAAC - ~ 
TTATGAAAAA TCAATCATCG AAGC GTTTGA AAAAGCAAAC CCAGATATAA AAGTGAAATT 
GGAAAC CATC GACTTCAAGT CAGGTCCTGA AAAAATCACA ACAGCCATCG AAGCAGGAAC 
AGCTCCAGAC GTACTCTTTG ATGCACCAGG ACGTATCATC CAATACGGTA AAAACGGTAA 
ATTGGC TGAG TTGAATGACC TCTTCACAGA TGAATTTGTT AAAGATGTCA ACAATGAAAA 
CATCGTACAA GCAAGTAAAG CTGGAGACAA GGCTTATATG TATCCGATTA GTTCTGCCCC 
ATTCTACATG GCAATGAACA AGAAAATGTT AGAAGATGCT GGAGTAGCAA AC CTTGTAAA 
AGAAGGTTGG ACAACTGATG ATTTTGAAAA AGTATTGAAA GCACTTAAAG ACAAGGGTTA 
CACACCAGGT TCATTGTTCA GTTCTGGTCA AGGGGGAGAC CAAGGAACAC GTGCCTTTAT 
CTCTAACCTT TATAGCGGTT CTGTAACAGA TGAAAAAGTT AGCAAATATA CAACTGATGA 
TCCTAAATTC GTCAAAGGTC TTGAAAAAGC AACTAGCTGG ATTAAAGACA ATTTGATCAA 
TAATGGTTCA C AATTTG AC G GTGGGGCAGA TATCCAAAAC TTTGCCAACG GTCAAACATC 
TTACACAATC CTTTGGGCAC CAGCTCAAAA TGGTATCCAA GCTAAACTTT TAGAAGCAAG 
TAAGGTAGAA GTGGT AG AAG TACCATTCCC ATCAGACGAA GGTAAGCCAG CTCTTGAGTA 
CCTTGTAAAC GGGTTTGCAG TATTCAACAA TAAAGACGAC AAGAAAGTCG CTGCATCTAA 
GAAATTCATC CAGTTTATCG CAGATGACAA GGAGTGGGGA CCTAAAGACG TAGTTCGTAC 
AGGTGCTTTC CCAGTCCGTA CTTCATTTGG AAAACTTTAT GAAGACAAAC GCATGGAAAC 
AATCAGCGGC TGGACTCAAT ACTACTCACC ATACTACAAC ACTATTGATG GATTTGCTGA 
AATGAGAACA CTTTGGTTCC CAATGTTGCA ATC TGTATC A AATGGTGACG AAAAACCAGC 
AGATGCTTTG AAAGCCTTCA CTGAAAAAGC GAACGAAACA ATCAAAAAAG CTATGAAACA 
A . ". ' ■ 



(2) INFORMATION FOR SEQ ID NO: 26: 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 6 : 

Gly Asn Ser Gly Gly. Ser Lys Asp Ala Ala Lys Ser Gly Gly Asp Gly 
15 10 15 

Ala Lys Thr Glu lie Thr Trp Trp Ala Phe Pro Val Phe Thr Gin Glu 
20 25 30 

Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu Lys Ser lie lie Glu Ala 
35 40 45 

Phe Glu Lys Ala Asn Pro Asp lie Lys Val Lys Leu Glu Thr lie Asp 
50 55 60 

Phe Lys Ser Gly Pro Glu Lys lie Thr Thr Ala lie Glu Ala Gly Thr 
65 70 75 80 

Ala Pro Asp Val Leu Phe Asp Ala Pro Gly Arg lie lie Gin Tyr Gly 
85 90 95 

Lys Asn Gly Lys Leu Ala Glu Leu Asn Asp Leu Phe Thr Asp Glu Phe 
100 105 110 

Val Lys Asp Val Asn Asn Glu Asn lie Val Gin Ala Ser Lys Ala Gly 
115 120 -125 

Asp Lys Ala Tyr Met Tyr Pro lie Ser Ser Ala Pro Phe Tyr Met Ala 
130 135 140 

Met Asn Lys Lys Met Leu Glu Asp Ala Gly Val Ala Asn Leu Val Lys 
145 . 150 155 160 

Glu Gly Trp Thr Thr Asp Asp Phe Glu Lys -Val Leu Lys Ala Leu Lys 
165 170 175 

Asp Lys Gly Tyr Thr Pro Gly Ser Leu Phe Ser Ser Gly Gin Gly Gly 
180 185 190 

Asp Gin Gly Thr Arg Ala Phe lie Ser Asn Leu Tyr Ser Gly Ser Val 
195 200 205 

Thr Asp Glu Lys Val Ser Lys Tyr Thr Thr Asp Asp Pro Lys Phe Val 
210 215 220 

Lys Gly Leu Glu Lys Ala Thr Ser Trp lie Lys Asp Asn Leu lie Asn 

.225 ... 230 ^ 235 240 

Asn Gly Ser Gin Phe Asp Gly Gly Ala Asp lie Gin Asn Phe Ala Asn 
245 . 250 255 

Gly Gin Thr Ser Tyr Thr lie .Leu Trp Ala Pro Ala Gin Asn Gly lie 
260" ' . . 265 ~" "" 270 
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Gin Ala Lys Leu Leu Glu Ala Ser Lys Val Glu Val Val Glu Val Pro 
275 280 285 

Phe Pro Ser Asp Glu Gly Lys Pro Ala Leu Glu Tyr Leu Val Asn Gly 

290 295 _ 300 

Phe Ala Val Phe Asn Asn Lys Asp Asp Lys Lys Val Ala Ala Ser Lys 
305 310 315 320 

Lys Phe lie Gin. Phe lie Ala Asp Asp Lys Glu Trp Gly Pro Lys Asp 
325 330 335 

Val Val Arg Thr Gly Ala Phe Pro Val Arg Thr Ser Phe Gly Lys Leu 
340 345 350 

Tyr Glu Asp Lys Arg Met Glu Thr lie Ser Gly Trp Thr Gin Tyr Tyr 
, 355 360 365 

Ser Pro Tyr Tyr Asn Thr lie Asp Gly Phe Ala Glu Met Arg Thr Leu 
370 375 380 



Trp Phe Pro Met Leu Gin Ser Val Ser Asn Gly Asp Glu Lys Pro Ala 
385 390 395 400 

Asp Ala Leu Lys Ala Phe Thr Glu Lys Ala Asn Glu Thr lie Lys Lys 
405 410 415 

Ala Met Lys Gin 
420 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs \ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTCACAAGAA AAAACAAAAA ATGAAGATGG AGAAACTAAG ACAGAACAGA CAGCCAAAGC 60 

TGATGGAACA GTCGGTAGTA AGTCTCAAGG AGCTGCCCAG AAGAAAGCAG AAGTGGTCAA 120 

TAAAGGTGAT TACTACAGCA TTCAAGGGAA ATACGATGAA ATCATCGTAG CCAACAAACA 180 

CTATCCATTG TCTAAAGACT ATAATCCAGG GGAAAATCCA ACAGCCAAGG CAGAGTTGGT 240 

CAAACTCATC AAAGCGATGC AAGAGGC AGG TTTCCCTATT AGTGATCATT ACAGTGGTTT 3 00 

TAGAAGTTAT GAAACTCAGA CCAAGCTCTA TCAAGATTAT GTCAACCAAG ATGGAAAGGC 3 60 

AGCAGCTGAC CGTTACTCTG CCCGTCCTGG CTATAGCGAA CACCAGACAG GCTTGGCCTT 420 

TGATGTGATT GGGACTGATG GTGATTTGGT GACAGAAGAA AAAGCAGCCC AATGGCTCTT 48 0 

GGATCATGCA GCTGATTATG GCTTTGTTGT CCGTTATCTC AAAGGCAAGG AAAAGGAAAC 540 

AGGCTATATG GCTGAAGAAT GGCACCTGCG TTATGTAGGA AAAGAAGCTA AAGAAATTGC_ 600- 

TGGAAGTGGT CTCAGTTTGG AAGAATACTA TGGCTTTGAA GGCGGAGACT ACGTCGAT 658 
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(2) INFORMATION FOR SEQ ID NO: 28: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid \ 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



Ser Gin Glu Lys Thr Lys Asn Glu Asp Gly Glu Thr Lys Thr Glu Gin 
1 .5 10 15 

Thr Ala Lys Ala Asp Gly Thr Val Gly Ser Lys Ser Gin Gly Ala Ala 
20 25 30 

Gin. Lys Lys Ala Glu Val Val Asn Lys Gly Asp Tyr Tyr Ser lie Gin 
35 40 45 

Gly Lys Tyr Asp Glu lie lie Val Ala Asn Lys His Tyr Pro Leu Ser 
50 55 60 

Lys Asp Tyr Asn Pro Gly Glu Asn Pro Thr Ala Lys Ala Glu Leu Val 
65 70 75 80 

Lys Leu lie Lys Ala Met Gin Glu Ala Gly Phe Pro He Ser Asp His 
85 90 95 

Tyr Ser Gly Phe Arg Ser Tyr Glu Thr Gin Thr Lys Leu Tyr Gin Asp 
IU 100 105 110" 

;U . Tyr Val Asn Gin Asp Gly Lys Ala Ala . Ala Asp Arg Tyr Ser Ala Arg 

115 120 125 

Pro Gly Tyr Ser Glu His Gin Thr Gly Leu Ala Phe Asp Val He Gly 
130 135 140 

Thr Asp Gly Asp Leu Val Thr Glu Glu Lys Ala Ala Gin Trp Leu Leu 
145 150 155 160 

Asp His Ala Ala Asp Tyr Gly Phe Val Val Arg Tyr Leu Lys Gly Lys 
165 170 175 

Glu Lys Glu Thr Gly Tyr Met Ala Glu Glu Trp His Leu Arg Tyr Val 
180 185 190 

Gly Lys Glu Ala Lys Glu He Ala Ala Ser Gly Leu Ser Leu Glu Glu 
195 200 205 

Tyr Tyr Gly Phe Glu Gly Gly Asp Tyr Val Asp 
210 215 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 790 base pairs 

(B) TYPE nucleic acid 

"(C) STRANDEDNESS: "double 

<D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GAAAGGTCTG TGGTCAAATA ATCTTACCTG CGGTTATGAT GAAAAAATAA TCTTGGAAAA 60 

TATAAATATA AAAAT AC CTG AAGAAAAAAT ATCAGTTATT ATTGGGTCAA ATGGTTGTGG 120 

GAAATCAACA CTCATTAAAA CCTTGTCTCG ACTTATAAAG CCATTAGAGG GAGAAGTATT 180 

GCTTGATAAT AAATCAATTA ATTCTTATAA AGAAAAAGAT TTAGCAAAAC . ACATAGCTAT 240 

ATTACCTCAA TCTCCAATAA TCCCTGAATC . AATAAC AGTA GCTGATCTTG TAAGCCGTGG 300 

TCGTTTCCCC TACAGAAAGC CTTTTAAGAG TCTTGGAAAA GATGACCTTG AAATAATAAA 3 60 

CAGATCAATG GTTAAGGC C A ATGTTGAAGA TCTAGCAAAT AACCTAGTTG AAGAAC TTTC 420 

Q TGGGGGTCAA AGGCAAAGAG TATGGATAGC TCTAGCCCTA GCCCAAGATA CAAGTATCCT 480 

>C\ ACTTTTAGAT GAGCCAACTA CTTACTTGGA TATCTCATAT CAAATAGAAC T ATT AG AC C T 540 

^ CTTGACTGAT CTAAACCAAA AATATAAGAC AACCATTTGC ATGATTTTGC ACGATATAAA 600 

|"U TCTAACAGCA AGATACGCTG ATTACCTATT TGCAATTAAA GAAGGTAAAC TTGTTGCAGA 660 

H GGGAAAGCCT GAAGATATAC TAAATGATAA ACTAGTTAAA GATATCTTTA ATCTTGAAGC 720 

p AAAAATTATA CGTGACCCTA TTTCCAATTC GCCTCTAATG ATTC CTATTG GCAAGCACCA 780 

ill , - 

p t TGTTAACTCT ^ ~" 790 

^ (2) INFORMATION FOR SEQ ID NO : 3 0 : 

M; (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 
.(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Lys Gly Leu Trp Ser Asn Asn Leu Thr Cys Gly Tyr Asp Glu Lys lie 
1 5 10 15 

lie Leu Glu Asn lie Asn lie Lys lie Pro Glu Glu Lys lie Ser Val 
20 25 30 

lie lie Gly Ser Asn Gly Cys Gly Lys Ser Thr Leu lie Lys Thr Leu 
35 40 45 

Ser Arg Leu. lie Lys Pro Leu Glu Gly Glu Val Leu Leu Asp Asn Lys 
50 55 60 

■Ser lie Asn Ser Tyr Lys Glu Lys Asp Leu Ala Lys His lie Ala lie 

._JSS 70 " 75 80 



Leu Pro Gin Ser Pro lie lie Pro Glu Ser" lie Thr Val Ala Asp Leu 
85 .90 95 



• 
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— Val Ser Arg Gly Arg Phe Pro Tyr Arg Lys Pro Phe. Lys Ser Leu Gly 
100 105 110 



Lys Asp Asp Leu Glu lie. lie Asn Arg Ser Met Val Lys Ala Asn Val 
115 ' - . 120 125 

Glu Asp Leu Ala" Asn Asn Leu Val Glu Glu Leu Ser Gly Gly Gin Arg 
130 135 140 

Gin Arg Val Trp lie Ala Leu Ala Leu Ala Gin Asp Thr Ser He Leu 
145 150 155 160 

Leu Leu Asp Glu Pro Thr Thr Tyr Leu Asp He Ser Tyr Gin He Glu 
165 170 , ■ 175 

Leu' Leu Asp Leu Leu Thr Asp Leu Asn Gin Lys Tyr Lys Thr Thr He 
180 185 190 

Cys Met He Leu His Asp He Asn Leu Thr Ala Arg Tyr Ala Asp Tyr 

195 ™— —2 00 " " 205 

Leu Phe Ala He Lys Glu Gly Lys Leu Val Ala Glu Gly Lys Pro Glu 
210 215 220 

Asp He Leu Asn Asp Lys Leu Val Lys Asp He Phe Asn Leu Glu Ala 
225 230 235 240 

Lys He He Arg Asp Pro He Ser Asn Ser Pro Leu Met He Pro He 
245 250 255 

Gly Lys His His Val Ser 

_ 260 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base. pairs * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



AAACTCAGAA 


AAGAAAGCAG 


ACAATGCAAC 


AACTATCAAA 


ATCGCAACTG- 


TTAACCGTAG 


60 


CGGTTCTGAA 


GAAAAACGTT 


GGGACAAAAT 


CCAAGAATTG 


GTTAAAAAAG 


ACGGAATTAC 


120 


CTTGGAATTT 


ACAGAGTTCA 


CAGACTACTC 


ACAACCAAAC 


AAAGCAACTG 


CTGATGGCGA 


180 


AGTAGATTTG 


AACGCTTTCC 


AAC AC TATAA 


CTTCTTGAAC 


AACTGGAACA 


AAGAAAAC GG 


240 


AAAAGAC CTT 


GTAGC GATTG 


CAGATACTTA 


CATCTCTCCA 


ATCCGCCTTT 


ACTCAGGTTT 


300 


GAATGGAAGT 


GCCAACAAGT 


ACACTAAAGT 


AGAAGACATC 


CCAGCAAACG 


GAGAAATCGC 


360 


TGTACCGAAT 


GACGCTACAA 


ACGAAAGCCG 


TGC GCTTTAT 


TTGCTTCAAT 


CAGCTGGCTT 


420 


GATTAAATTG 


GATGTTTCTG 


GAACTGCTCT 


TGCAACAGTT 


GCCAACATCA 


AAGAAAATCC 


480 


AAAGAACTTG 


AAAATCACTG 


AATTGGACGC 


TAGCCAAACA 


GCTCGTTCAT 


TGTCATCAGT 


540 
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TGACGCTGCC GTTGTAAACA ATACCTTCGT TACAGAAGCA AAATTGGACT ACAAGAAATC 60 6 

ACTTTTCAAA GAACAAGCTG ATGAAAACTC AAAACAATGG TACAACATCA TTGTTGCAAA 660 

AAAAGATTGG GAAACATCAC CTAAGGCTGA TGCTATCAAG AAAGTAATCG CAGCTTACCA 72 0 

CACAGATGAC GTGAAAAAAG TTATCGAAGA ATCATCAGAT GGTTTGGATC AACCAGTTTG 780 
G - 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - '" ~" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Asn Ser Glu Lys Lys Ala Asp Asn Ala Thr Thr He Lys He Ala Thr 
1 5 10 15 

Val Asn Arg Ser Gly Ser Glu Glu Lys Arg Trp Asp Lys He Gin Glu 
20 25 30 

Leu Val Lys Lys Asp Gly He Thr Leu Glu Phe Thr Glu Phe Thr Asp 
35 40 45 

Tyr Ser Gin Pro "Asn Lys Ala Thr Ala Asp Gly Glu Val Asp Leu Asn 
50 55 60 

Ala Phe Gin His Tyr Asn Phe Leu Asn Asn Trp Asn Lys Glu Asn Gly 
65 70 75 80 

Lys Asp Leu Val Ala He Ala Asp Thr Tyr He Ser Pro He Arg Leu 
85 90 95 

Tyr Ser Gly Leu Asn Gly Ser Ala Asn Lys Tyr Thr Lys Val Glu Asp 
100 105 HO 

He Pro Ala Asn Gly Glu- He Ala Val Pro Asn Asp Ala Thr Asn Glu 
115 120 125 

Ser Arg Ala Leu Tyr Leu Leu Gin Ser Ala Gly Leu He Lys Leu Asp 
130 135 140 

Val Ser Gly Thr Ala Leu Ala Thr Val Ala .Asn He Lys Glu Asn Pro 
145 150 155 160 

Lys Asn Leu Lys He Thr Glu Leu Asp Ala Ser Gin Thr Ala Arg Ser 
165 "170 175 

Leu Ser Ser Val Asp Ala Ala Val Val Asn Asn Thr Phe^Val Thr Glu 
180 185 S 190 . 

Ala Lys Leu Asp Tyr Lys Lys Ser Leu Phe-Lys Glu~Gln Ala Asp_Glu. 
195 200. .. " - ' 205 
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Asn Ser Lys Gin Trp Tyr Asn lie lie Val Ala Lys Lys Asp Trp Glu 
210 215 220 

Thr Ser Pro Lys Ala Asp Ala lie Lys Lys Val lie Ala Ala Tyr His 
225 230 235 240 

Thr Asp Asp Val Lys Lys Val lie Glu Glu Ser Ser Asp Gly Leu Asp 
245 250 255 

Gin Pro Val Trp 
260 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH : 640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TTCGAAAGGG TCAGAAGGTG CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA 
TCAATTTTAT GAGCAAGTGA AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC 
CATCCAAAAA GTTTTTGAAA AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA 
TAC TATTGCC GAAGAAAAAA AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC 
AGGTATGACT CTTGAAACAC GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC 
AGTTAAGAAG GTAGCAGAAG CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA 
GTACACTCCA GATGTAACGG CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA 
AGTTCTC GAA AAAGC C AAGG CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC 
AACTGATGAA AAAACAAAAG AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA 
AGTAC CTGGA GCAAGTCCAA AAAAGCCGCT TTTCGC TTTT AGATGTGGGA TGGTGTTTCT 
GGATGTGGAT TACAGCAACT GGGGCACACC AAGCCTACAG 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Ser Lys Gly Ser Glu Gly Ala Asp Leu lie Ser Met Lys Gly Asp Val 

1 - . 5 10 _ 15 

lie Thr Glu His Gin Phe Tyr Glu Gin Val Lys Ser Asn Pro Ser Ala 
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20 25 .30 

Gin Gin Val Leu Leu Asn Met Thr lie Gin Lys Val Phe Glu Lys Gin 

35 - - . 40 45 

Tyr Gly Ser Glu Leu Asp Asp Lys Glu Val Asp Asp Thr lie Ala Glu 
50 55 60 

Glu Lys Lys Gin Tyr Gly Glu Asn Tyr Gin Arg Val Leu Ser Gin Ala 
65 70 75 80 

Gly Met Thr Leu Glu Thr Arg Lys Ala Gin lie Arg Thr Ser Lys Leu 
85 90 95 

Val Glu Leu Ala Val Lys Lys Val Ala Glu Ala Glu Leu Thr Asp Glu 
100 105 110 

Ala Tyr Lys Lys Ala Phe Asp Glii Tyr Thr Pro Asp Val Thr Ala Gin 
115 120 ' 125 

lie lie Arg Leu Asn Asn Glu Asp Lys Ala Lys Glu Val Leu Glu Lys 
130 135 140 

Ala Lys Ala Glu Gly Ala Asp Phe Ala Gin Leu Ala Lys Asp Asn Ser 
145 150 155 160 

Thr Asp Glu Lys Thr Lys Glu Asn Gly Gly Glu lie Thr Phe Asp Ser 
165 170 175 

Ala Ser Thr Glu Val Pro Gly Ala Ser Pro Lys Lys Pro Leu Phe Ala 
180 185 190 

Phe Arg Cys Gly Met Val Phe Leu Asp Val Asp Tyr Ser Asn Trp Gly 
195 200 205 

Thr Pro Ser Leu Gin 
210 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGGGATGGCA GCTTTTAAAA ATC CTAACAA TCAATACAAA GCTATTACAA TTGCTCAAAC 
TCTAGGTGAT GATGC TTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG CTGTTCAGTG 
TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTTAAAACT AAAGCTACGG TTGTAGAAAA 
ACCACTGAAA GATTTTAGAG CGTCTACGTC TGATCAGTCT GGTTGGGTGG . AATCTAATGG 
TAAATGGTAT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA CAGATGGTAA 



ATGGT AC TAT TTGAATGACT TAGGTGTCAT GCAGACTGGA TTTGTAAAAT TTTCTGGTAG 
CTGGTATTAC TTGAGCAATT CAGGTGCTAT GTTTACAGGC TGGGGAACAG ATGGTAGCAG 
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ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG AAAATGGCAC 
TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG TCGGACCACA 
C TGGTAC TAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC CAGATGGTTA 
CCGTGTAAAT GGTAATGGTG AATGGGTAAA C 
(2) INFORMATION FOR SEQ ID NO: 36:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: - ----- - - 

Gly Met Ala Ala Phe Lys Asn .Pro Asn Asn Gin Tyr Lys Ala lie Thr 
15 10 15 

lie Ala Gin Thr Leu Gly Asp Asp Ala Ser Ser Glu Glu Leu Ala Gly 
20 25 30 

Arg Tyr Gly Ser Ala Val Gin Cys Thr Glu Val Thr Ala Ser Asn Leu 
35 40 45 

Ser Thr Val Lys Thr Lys Ala Thr Val Val Glu Lys Pro Leu Lys Asp 
50 , ' 55 60 

Phe Arg Ala Ser Thr Ser Asp Gin Ser Gly TrpVal Glu Ser Asn Gly 
65 70 75 80 

Lys Trp Tyr Phe Tyr Glu Ser Gly Asp Val Lys Thr Gly Trp Val Lys 
85 90 95 

Thr Asp Gly Lys Trp Tyr Tyr Leu Asn Asp Leu Gly Val Met Gin Thr 
100 105 . HO 

Gly Phe Val Lys Phe Ser Gly Ser Trp Tyr Tyr Leu Ser Asn Ser Gly 
115 120 125 

Ala Met Phe Thr Gly Trp Gly Thr Asp Gly Ser Arg Trp Phe Tyr Phe 
130 135 140 

Asp Gly Ser Gly Ala Met Lys Thr Gly Trp Tyr Lys Glu Asn Gly Thr 
145 150 155 , .160 

Trp Tyr Tyr Leu Asp Glu Ala Gly lie Met Lys Thr Gly Trp Phe Lys 
165 170 175 

Val Gly Pro His Trp Tyr Tyr Ala Tyr Gly Ser Gly Ala Leu Ala Val 
180 185 190 

Ser Thr Thr Thr Pro Asp Gly Tyr Arg Val Asn Gly Asn Gly Glu Trp 

195 200 205 — 



Val Asn 
210 



148 



(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



AGACGAGCAA 


AAAATTAAGC 


AAGCAGAAGC 


GGAAGTTGAG 


AGTAAACAAG 


CTGAGGCTAC 


60 


AAGGTTAAAA 


AAAATCAAGA 


CAGATCGTGA 


AGAAGCAGAA 


GAAGAAGCTA 


AACGAAGAGC 


120 


AG ATGC T AAA 


GAGCAAGGTA 


AACCAAAGGG 


GCGGGCAAAA 


CGAGGAGTTC 


CTGGAGAGCT 


180 


AGCAACACCT 


GATAAAAAAG 


AAAATGATGC 


GAAGTCTTCA 


GATTCTAGCG 


TAGGTGAAGA 


240 


AACTCTTCCA 


AGCCCATCCC 


TGAAACCAGA 


AAAAAAGGTA 


GCAGAAGCTG 


AGAAGAAGGT 


300 


TGAAGAAGCT 


AAGAAAAAAG 


CCGAGGATCA 


AAAAGAAGAA 


GATCGCCGTA 


ACTACCCAAC 


360 


CAATACTTAC 


AAAACGCTTG 


AAC TTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAGC 


420 


GGAGCTTGAA 


CTAGTAAAAG 


AGGAAGCTAA 


GGAACCTCGA 


AACGAGGAAA 


AAGTTAAGCA 


480 


AGCAAAAGCG 


GAAGTTGAGA 


GTAAAAAAGC 


TGAGGC T AC A 


AGGTTAGAAA 


AAATCAAGAC 


540 


AGATC GT AAA 


AAAGCAGAAG 


AAGAAGCTAA 


ACGAAAAGCA 


GCAGAAGAAG 


ATAAAGTTAA, 


600. 


AGAAAAACCA 


GCTGAACAAC 


CACAACCAGC 


GCCGGCTCCA 


AAAGCAGAAA 


AACCAGCTCC 


~ 660 


AGCTCCAAAA 


CCAGAGAATC 


CAGCTGAACA 


ACCAAAAGCA 


GAAAAACCAG 


CTGATCAACA 


720 


nut 1 vjrvrtAjrirt. 


APT ATGCTC 


GTAGATCAGA 


AGAAGAATAT 


AATCGCTTGA 


CTCAACAGCA 


780 


ACCGCCAAAA 


ACTGAAAAAC 


CAGCACAACC 


ATCTACTCCA 


AAAACAGGCT 


GGAAACAAGA 


840 


AAACGGTATG 


TGGTACTTCT 


ACAATACTGA 


TGGTTCAATG 


GCGACAGGAT 


GGCTCCAAAA 


900 


CAATGGCTCA 


TGGTACTACC 


TCAACAGCAA TGGCGCTATG 


GCGACAGGAT 


GGCTCCAAAA 


960 


CAATGGTTCA 


TGGTACTATC 


TAAACGCTAA 


TGGTTCAATG 


GCAACAGGAT 


GGCTCCAAAA 


1020 


CAATGGTTCA 


TGGTACTACC 


TAAACGCTAA 


TGGTTCAATG 


GCGACAGGAT 


GGCTCCAATA 


1080 


CAATGGCTCA 


TGGTACTACC 


TAAACGCTAA 


TGGTTCAATG 


GCGACAGGAT 


GGCTCCAATA 


1140 


CAATGGCTCA 


TGGTACTACC 


TAAACGCTAA 


TGGTGATATG 


GCGACAGGTT 


GGGTGAAAGA 


1200 


TGGAGATACC 


TGGTACTATC 


TTGAAGCATC 


AGGTGCTATG 


AAAGCAAGCC 


AATGGTTCAA 


1260 


AGTATCAGAT 


AAATGGTACT 


ATGTCAATGG 


CTCAGGTGCC 


CTTGCAGTCA 


. AC AC AAC TGT 


1320 


AGATGGC TAT 


GGAGTCAATG 


CCAATGGTGA 


, ATGGGTAAAC 






1360 



(2) INFORMATION FOR SEQ ID NO: 38: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 453 amino acids 
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(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID- NO: 38: 

Asp Glu Gin Lys lie Lys Gin Ala Glu Ala Glu Val Glu Ser Lys Gin 
1 5 10 - 15 

Ala Glu Ala Thr Arg Leu Lys Lys lie Lys Thr Asp Arg Glu Glu Ala 
20 - 25 30 

Glu Glu Glu Ala Lys Arg Arg Ala Asp Ala Lys Glu Gin Gly Lys Pro 
35 40 45 

Lys Gly Arg Ala Lys Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp . 
50 55 60 

Lys Lys Glu Asn Asp Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu 
65 70 75 80 

Thr Leu Pro Ser Pro Ser Leu Lys Pro Glu Lys Lys Val Ala Glu Ala 
85 90 95 

Glu Lys Lys Val Glu Glu Ala Lys Lys Lys Ala Glu Asp Gin Lys Glu 
100 105 110 

Glu Asp Arg Arg Asn. Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Glu Leu 
115 120 125 

Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu Glu Leu 
130 135 140 

Val Lys Glu Glu Ala Lys Glu Pro Arg Asn Glu Glu Lys Val Lys Gin 
145 150 155 . 160 

Ala Lys Ala Glu Val Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu 
165 170 175 

Lys lie Lys Thr Asp Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys 
180 185 190 . 

Ala Ala Glu Glu Asp Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin 
195 200 205 

Pro Ala Pro Ala Pro Lys Ala Glu Lys Pro Ala Pro Ala Pro Lys Pro 
210 215 220 

Glu Asn Pro Ala < Glu Gin Pro Lys Ala Glu Lys Pro Ala Asp Gin Gin 
225 230 235 240 

Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Glu Glu Tyr Asn Arg Leu 

245 - 250 - " - 255 

Thr Gin Gin Gin Pro Pro Lys Thr Glu Lys Pro Ala Gin Pro Ser Thr 
260 265 270 



„Pro Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn 
275 .280 285 
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Thr Asp Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp 
290 295 300 

Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala -Thr Gly Trp Leu Gin Asn 
305 310 315 320 

Asn Gly Ser Trp Tyr. Tyr Leu Asn Ala Asn Gly Ser Met Ala Thr Gly 
325 330 335 

Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ser 
340 345 350 

Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn 
355 ' 360 365 

Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp 
370 375 380 

Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr Gly Trp Val Lys Asp 
385 ' 390^ 395 400 

Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met .Lys Ala Ser 
405 410 ' 415 

Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly Ser Gly 
420 425 430 

Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Gly Val Asn Ala Asn 
435 440 445 

Gly Glu Trp Val Asn 
450 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CTGTGGTGAG GAAGAAACTA AAAAGACTCA AGCAGCACAA CAGCCAAAAC AACAAACGAC 
TGTACAACAA ATTGCTGTTG GAAAAGATGC TCCAGACTTC ACATTGCAAT CCATGGATGG 
CAAAGAAGTT AAGTTATCTG ATTTTAAGGG TAAAAAGGTT TACTTGAAGT TTTGGGCTTC 
ATGGTGTGGT CCATGCAAGA AAAGTATGCC AGAGTTGATG GAACTAGCGG CGAAACCAGA 
TCGTGATTTC GAAATTCTTA CTGTCATTGC ACCAGGAATT CAAGGTGAAA AAACTGTTGA 
GCAATTC CCA CAATGGTTCC AGGAACAAGG ATATAAGGAT ATCCCAGTTC TTTATGATAC 
CAAAGCAACC ACTTCCAAGC TTATCAAATT CGAAGCATTC CTACAGAATA TT 
(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Gly Glu Glu Glu Thr Lys Lys Thr Gin Ala Ala Gin Gin Pro Lys 
15 10 15 

Gin Gin Thr Thr Val Gin Gin lie Ala Val Gly Lys Asp Ala Pro Asp 
20 25 30 

Phe Thr Leu Gin Ser Met Asp Gly Lys Glu Val Lys Leu Ser Asp Phe 
35 40 45 

Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Ala Ser Trp Cys Gly Pro 
50 55 60 

Cys Lys Lys Ser Met Pro Glu Leu Met Glu Leu Ala Ala Lys Pro Asp 
65 70 75 80 

Arg Asp Phe Glu lie Leu Thr Val He Ala Pro Gly He Gin Gly Glu 
85 90 95 

Lys Thr Val Glu Gin Phe Pro Gin Trp Phe Gin Glu Gin Gly Tyr Lys 
100 105 110 

Asp He Pro Val Leu Tyr Asp Thr Lys Ala Thr Thr Ser Lys Leu He 
115 120 125 



Lys Phe Glu Ala Phe Leu Gin Asn He 
130 135 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GACTTTTAAC AATAAAACTA TTGAAGAGTT GCACAATCTC CTTGTCTCTA AGGAAATTTC 
TGCAACAGAA TTGAC CCAAG CAACACTTGA AAATATCAAG TCTCGTGAGG AAGCCCTCAA 
TTCATTTGTC ACCATCGCTG AGGAGCAAGC TCTTGTTCAA GCTAAAGCCA TTGATGAAGC 
tGGAATTGAT GCTGACAATG TCCTTTCAGG AATTCCACTT GCTGTTAAGG ATAACATCTC 
TACAGACGGT ATTCTCACAA CTGCTGCCTC AAAAATGCTC TACAACTATG AGCCAATCTT 
TGATGCGAC a gCTgTTGCCA ATGCAAAAAC CAAGGGCATG ATTGTCGTTG GAAAGACCAA 
CATGGACGAA TTTGCTATGG GTGGTTCAGG t GAAAC TTC A CACTACGGAG CAACTAAAAA 



CGCTTGGAAC CACAGCAAGG TTCCTGGTGG GTCATCAAGT GGTTCTGCCG CAGCTGTAGC 



H r. 5 
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CTCAGGACAA GTTCGCTTGT CACTTGGTTC TGATAC TGGT GGTTCCATCC GCCAACCTGC 540 

TGCCTTCAAC GGAATCGTTG GTCTCAAACC AACCTACGGA ACAGTTTCAC GTTTCGGTCT 600 

CATTGCCTTT GGTAGCTCAT TAGACCAGAT TGGACCTTTT GCTCCTACTG TTAAGGAAAA 660 

TGCCCTCTTG CTCAACGCTA TTGCCAGCGA AGATGCTAAA GACTCTACTT CTGCTCCTGT 720 

CCGCATCGCC GACTTTACTT CAAAAATCGG CCAAGACATC AAGGGTATGA AAATCGCTTT 780 

GCCTAAGGAA TACCTAGGCG AAGGAATTGA TCCAGAGGTT AAGGAAACAA TCTTAAACGC 840 

GGCCAAACAC TTTGAAAAAT TGGGTGCTAT CGTCGAAGAA GTCAGCCTTC CTCACTCTAA 900 

ATAC GGTGTT GCCGTTTATT ACATCATCGC TTCATCAGAA GCTTCATCAA ACTTGCAACG 960 

CTTCGACGGT ATCCGTTACG GCTATCGCGC AGAAGATGCA ACCAACCTTG ATGAAATC T A 1020 

TGTAAACAGC CGAAGCCAAG GTTTTGGTGA AGAGGTAAAA CGTCGTATCA TGCTGGGTAC 1080 

TTTCAGTCTT TCATCAGGTT ACTATGATGC CTACTACAAA AAGGCTGGTC AAGTCCGTAC 1140 

lil CCTCATCATT CAAGATTTCG AAAAAGTCTT CGCGGATTAC GATTTGATTT TGGGTCCAAC 1200 

TGCTCCAAGT gttgcctatg ACTTGGATTC TCTCAACCAT GACCCAGTTG CCATGTACTT 1260 

1"~ AGCCGACCTA TTG AC CAT AC CTGTAAACTT GGCAGGACTG CCTGGAATTT CGATTCCTGC 132 0 

j 8 ? TGGATTC TCT CAAGGTCTAC CTGTCGGACT CCAATTGATT GGTCCCAAGT ACTCTGAGGA 13 80 

^ AACCATTTAC CAAGCTGCTG CTGCTTTTGA AGCAACAACA GACT AC C AC A AACAACAACC 1440 
CGTGATTTTT GGAGGTGACA AC .1462- 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 amino acids ■ . , ' 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: .protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Thr Phe Asn Asn Lys Thr lie Glu Glu Leu His Asn Leu Leu Val Ser 
1 5 10 15 

Lys Glu He Ser Ala Thr Glu Leu. Thr Gin Ala Thr Leu Glu Asn He 
20 25 30 

Lys Ser Arg Glu Glu Ala Leu Asn Ser Phe Val Thr lie Ala Glu Glu 
35 " 40 45 

Gin Ala Leu Val Gin Ala Lys Ala ile Asp Glu Ala Gly He Asp Ala 
50 55 60 
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Asp Asn Val Leu Ser Gly Ile Pro Leu Ala Val Lys Asp Asn Ile Ser 
65 70 75 80 
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Thr Asp Gly lie Leu Thr Thr Ala Ala Ser Lys Met Leu Tyr Asri Tyr 
85 90 95 

Glu Pro lie Phe Asp Ala Thr Ala Val Ala Asn Ala Lys Thr Lys Gly 
100 105 110 

Met lie Val Val Gly Lys Thr Asn Met Asp Glu Phe Ala Met Gly Gly 
115 120 125 

Ser Gly Glu Thr Ser His Tyr Gly Ala Thr Lys Asn Ala Trp Asn His 
130 .135 140" 

Ser Lys Val Pro Gly Gly Ser Ser Ser Gly Ser Ala Ala Ala Val Ala 
145 150 155 160 

Ser Gly Gin Val Arg Leu Ser Leu Gly Ser Asp Thr Gly Gly Ser ' lie 
165 170 . 175 

Arg Gin Pro Ala Ala Phe Asn Gly lie Val Gly Leu Lys Pro Thr Tyr 

180 185 -■- 190 



Gly Thr Val Ser Arg Phe Gly Leu lie Ala Phe Gly Ser Ser Leu Asp 
195 200 205 

Gin lie Gly Pro Phe Ala Pro Thr Val Lys Glu Asn Ala Leu Leu Leu 
210 215 220 

Asn Ala lie Ala Ser Glu Asp Ala Lys Asp Ser Thr Ser Ala Pro Val 
225 230 235 240 

Arg lie Ala Asp Phe Thr Ser Lys lie Gly Gin Asp He Lys Gly Met 
245 250 255 

Lys He Ala Leu Pro Lys Glu Tyr Leu Gly Glu Gly He Asp Pro Glu 
260 265 270 

Val Lys Glu Thr He Leu Asn Ala Ala Lys His Phe Glu Lys Leu Gly 
275 280 285 

Ala He Val Glu Glu Val Ser Leu Pro His Ser Lys Tyr Gly Val Ala 
290 295 300 

Val Tyr Tyr He He Ala Ser Ser Glu Ala Ser Ser Asn Leu Gin Arg 
305 310 315 320 

Phe Asp Gly He Arg Tyr Gly Tyr Arg Ala Glu Asp Ala Thr Asn Leu 
325 330 335 

Asp Glu lie Tyr Val Asn Ser Arg Ser Gin Gly Phe Gly Glu Glu Val 
340 345 - 350 

Lys Arg Arg He Met Leu Gly Thr Phe Ser Leu Ser Ser Gly Tyr Tyr 
355 360 — ■ 365 

Asp Ala Tyr Tyr Lys Lys Ala Gly Gin Val Arg Thr Leu He He Gin 
370 375 - 380 

Asp Phe Glu Lys Val Phe Ala Asp Tyr Asp Leu He Leu Gly Pro Thr 
385 390 395 400 



Ala Pro Ser Val Ala.Tyr^Asp Leu" Asp Ser Leu_.Asn His Asp Pro Val 
405 410 ... 415 
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Ala Met Tyr Leu Ala Asp Leu Leu Thr lie Pro Val Asn Leu Ala Gly 
' 420 425 430 

Leu Pro Gly lie Ser lie Pro Ala Gly Phe Ser Gin Gly Leu Pro. Val 
435 -440 445 

Gly Leu Gin Leu lie Gly Pro Lys Tyr Ser Glu Glu Thr lie Tyr Gin. 
450 455 460 

Ala Ala Ala Ala Phe Glu Ala Thr. Thr Asp Tyr His Lys Gin Gin Pro 
465 470 475 480 

Val lie Phe Gly Gly Asp Asn 
485 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 3 base pairs 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CTTTACAGGT AAACAACTAC AAGTCGGC G A CAAGGC GCTT GATTTTTCTC TTACTACAAC 
AGATCTTTCT AAAAAATCTC TGGCTGATTT TGATGGCAAG AAAAAAGTCT TGAGTGTCGT 
TCCTTCTATC GATACAGGCA TCTGCTCAAC TCAAACACGT CGTTTTAATG AAGAATTGGC 
TGGACTGGAC AACACGGTCG TATTGACTGT TTCAATGGAC CTACCTTTTG CTCAAAAACG 
TTGGTGC GGT GCTGAAGGCC TTGACAATGC CATTATGCTT TCAGACTACT TTGACCATTC 
TTTCGGGCGC GATTATGCCC TCTTGATCAA CGAATGGCAC . CTATTAGCAC GCGCAGTCTT 
TGTCCTCGAT ACTGACAATA CGATTCGCTA C GTTGAAT AC GTGGATAATA . TC AATTCTG A 
GCCAAACTTC GAA 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ■ ' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Phe Thr Gly Lys Gin Leu Gin Val Gly Asp Lys Ala Leu Asp Phe Ser 
1 5 10 15 

Leu Thr Thr Thr Asp Leu Ser Lys Lys Ser Leu Ala Asp Phe Asp Gly 

_...2.0_. .25 3 0 

Lys Lys Lys Val Leu Ser Val Val Pro Ser lie Asp Thr Gly lie Cys 
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35 40 45 

Ser Thr Gin Thr Arg Arg Phe Asn Glu Glu Leu Ala Gly Leu Asp Asn 
50 . 55 60 

Thr Val Val Leu Thr Val Ser Met Asp Leu Pro Phe Ala Gin Lys Arg 
65 70 ' 75 80 

Trp Cys Gly Ala Glu Gly Leu Asp Asn Ala lie Met Leu Ser Asp Tyr 
85 90 95 

Phe Asp His Ser Phe Gly Arg Asp Tyr. Ala Leu Leu lie Asn Glu Trp 
100 105 110 

His Leu Leu Ala Arg Ala Val Phe Val Leu Asp Thr Asp Asn Thr lie 
115 120 125 

Arg Tyr Val Glu Tyr Val Asp Asn lie Asn Ser Glu Pro Asn Phe Glu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 45:, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CCAGGCTGAT AC AAGTATC G CAGACATTCA AAAAAGAGGC GAAC TGGTTG TCGGTGTCAA ' 60 

ACAAGACGTT CCCAATTTTG GTTACAAnGA TCCCAAGACC GGTACTTATT CTGGTATCGA 12 0 

AaCCGACTTG GCCAAGATGG TAGCTGATGA ACTCAAGGTC AAGATTC GCT ATGTGCCGGT 180 

TACAGCACAA ACCCGCGGCC CCCTTCTAGA CAATGAACAG GTCGATATGG ATATCGC GAC 240 

CTTT AC CATC ACGGACGAAC GCAAAAAACT CTACAACTTT ACCAGTCCCT ACTACACAGA 300 

CGCTTCTGGA TTTTTGGTCA ATAAATCTGC CAAAATCAAA AAGATTGAGG ACCTAAACGG 3 60 

CAAAACCATC GGAGTCGCCC AAGGTTCTAT CACCCAACGC CTGATTACTG AACTGGGTAA 420 

AAAGAAAGGT CTGAAGTTTA AATTC GTCGA ACTTGGTTCC TACCCAGAAT TGATTACTTC 480 

CCTGCACGCT CATCGTATCG ATACCTTTTC CGTTGACCGC TCTATTCTAT C TGGC T AC AC 540 

TAGTAAACGG ACAGCACTAC TAGATGATAG TTTCAAGCCA TCTGACTACG GTATTGTTAG 600 

CAAGAAATCA AATACAGAGC TCAACGACTA TCTTGATAAC TTGGTTACTA AATGGAGCAA 660 

GGATGGTAGT TTGCAGAAAC TTTATGACCG TTACAAGCTC AAACCATCTA GCCATACTGC 720 

AGAT ... . "' . 724 
(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 
-(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein. 



(xi) SEQUENCE_DESCRIPTION: SEQ ID NO:46: 

Gin Ala Asp Thr Ser lie Ala Asp lie Gin Lys Arg Gly Glu Leu Val 
1 5 10 15 . 

Val Gly Val Lys Gin Asp Val Pro Asn Phe Gly Tyr Xaa Asp Pro Lys 

20 ■ 25 - 30 

Thr Gly Thr Tyr Ser Gly lie Glu Thr Asp Leu Ala Lys Met Val Ala 
35 40 45 

Asp Glu Leu Lys Val Lys lie Arg Tyr Val Pro Val Thr Ala Gin Thr 
50 55 60 

Arg Gly Pro Leu Leu Asp Asn~~Glu Gln~ Val Asp Met Asp lie Ala Thr 
65 70 75 80 

m 

!?? Phe Thr lie Thr Asp Glu Arg Lys Lys Leu Tyr Asn Phe Thr Ser Pro 

l^j 85 90 95 

Tyr Tyr Thr Asp Ala Ser Gly Phe Leu Val Asn Lys Ser Ala Lys lie 
^ 100 105 110 

O Lys Lys lie Glu Asp Leu Asn Gly Lys Thr lie Gly Val Ala Gin Gly 

W - 115 120 . 125 

Ser lie Thr Gin Arg Leu lie Thr Glu Leu Gly Lys Lys Lys Gly Leu 
130 135 140 



H Lys Phe Lys Phe Val Glu Leu Gly Ser Tyr Pro Glu Leu lie Thr Ser 

145 150 155 160 

Leu His Ala His Arg lie Asp Thr Phe Ser Val Asp Arg Ser lie Leu 
165 170 175 

Ser Gly Tyr Thr Ser Lys Arg Thr Ala Leu Leu Asp Asp Ser Phe Lys 
180 . , 185 190 

Pro Ser Asp Tyr Gly lie Val Thr Lys Lys Ser Asn Thr Glu Leu Asn 
195 200 205 

Asp Tyr Leu Asp Asn Leu Val Thr Lys Trp Ser Lys Asp Gly Ser Leu 
210 215 220 

Gin Lys Leu Tyr Asp Arg Tyr Lys Leu Lys Pro Ser Ser His Thr Ala 
225 230 . . 235 240 

Asp 

(2) INFORMATION FOR SEQ ID NO: 47 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : _ 1279 base pairs . 

(B) TYPE-: nucleic, acid" ■ __ 

: — (C). STRANDEDNESS: double . - 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ,47: ; — - 

GTCTGTATCA TTTGAAAACA AAGAAACAAA CCGTGGTGTC TTgACTTTCA CTATCTCTCA -60 

AGACCAAATC AAACCAGAAT TGGACCGTGT CTTCAAGtCA GTGAAGAAAT CTCTTAATGT 120 

TCCAGGTTTC CGTAAAGGTC ACCTTCCACG CCCTATCTTC GACCAAAAAT TTGGTGAAGA 18 0 

AGC TCTTTAT CAAGATGCAA TGAACGCACT TTTGCCAAAC GCTTATGAAG CAGCTGTAAA 240 

AGAAGCTGGT CTTGAAGTGG TTGCCCAACC AAAAATTGAC GTAACTTCAA TGGAAAAAGG 3 00 

TCAAGACTGG GTTATCACTG. CTGAAGTCGT TACAAAACCT GAAGTAAAAT TGGGTGACTA 3 60 

CAAAAACCTT GAAGTATCAG TTGATGTAGA AAAAGAAGTA ACTGACGCTG ATGTCGAAGA 420 

GCGTATCGAA CGCGAACGCA ACAACCTGGC TGAATTGGTT ATC AAGGAAG CTGCTGCTGA 48 0 

AAACGGC GAC ACTGTTGTGA TCGACTTCGT TGGTTCTATC GACGGTGTTG AATTTGACGG 540 

TGGAAAAGGT GAAAACTTCT CACTTGGACT TGGTTCAGGT CAATTCATCC CTGGTTTCGA 600 

AGACCAATTG GTAGGTC AC T CAGCTGGCGA AACCGTTGAT GTTATCGTAA CATTCCCAGA 660 
AGACTACCAA GCAGAAGACC TTGCAGGTAA AG AAGC T AAA TTCGTGACAA CTATCC AC GA - 720 

AGTAAAAGCT AAAGAAGTTC CGGCTCTTGA CGATGAACTT GCAAAAGACA TTGATGAAGA 780 

AGTTGAAACA . CTTGCTGAC T TGAAAGAAAA ATACAGCAAA GAATTGGCTG CTGCTAAAGA 840 

AGAAGCTTAC AAAGATGCAG TTGAAGGTGC AGCAATTGAT ACAGCTGTAG AAAATGCTGA 900 

AATCGTAGAA CTTCCAGAAG AAATGATCCA TGAAGAAGTT CACCGTTCAG TAAATGAATT 9 60 

CCTTGGGAAT TTGCAACGTC AAGGGATCAA CCCTGACATG TACTTC C AAA TCACTGGAAC 1020 

TACTCAAGAA GACCTTCACA ACCAATACCA AGCAGAAGCT GAGTCACGTA CTAAGACTAA 1080 

CCTTGTTATC GAAGCAGTTG CCAAAGCTGA AGGATTTGAT GCTTCAGAAG AAGAAATCCA 1140 

AAAAGAAGTT GAGCAATTGG CAGCAGACTA CAACATGGAA GTTGCACAAG TTCAAAACTT 1200 

GCTTTCAGCT GACATGTTGA AACATGATAT CACTATCAAA AAAGCTGTTG AATTGATCAC 12 60 

AAGCACAGCA ACAGTAAAA 1279 
(2) INFORMATION FOR SEQ ID NO: 48: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single _ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ser Val Ser Phe Glu Asn Lys Glu Thr Asn Arg Gly Val Leu Thr Phe 
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15 10 15 

Thr lie Ser Gin Asp Gin lie Lys Pro Glu Leu Asp Arg Val Phe Lys 
20 25 30 

Ser Val Lys Lys Ser Leu Asn Val Pro Gly Phe Arg Lys Gly His Leu 
35 40 .45 

Pro Arg Pro He Phe Asp Gin Lys Phe Gly Glu Glu Ala Leu Tyr Gin 
50 55 .60 

Asp Ala Met Asn Ala Leu Leu Pro Asn Ala Tyr Glu Ala Ala Val Lys 
65 70 75 80 

Glu Ala Gly Leu Glu Val Val Ala .Gin Pro Lys He Asp Val Thr Ser 
85 90 95 

Met Glu Lys Gly Gin Asp Trp Val He Thr Ala Glu Val Val Thr Lys 
100 105 HO 

Pro Glu Val Lys Leu Gly Asp Tyr Lys Asn Leu Glu Val Ser Val Asp 
115 120 125. 

Val Glu Lys Glu Val Thr Asp Ala Asp Val Glu Glu Arg He Glu Arg 
130 135 140 

Glu Arg Asn Asn Leu Ala Glu Leu Val He Lys Glu Ala Ala Ala Glu 
145 150 155 160 

Asn Gly Asp Thr Val Val He Asp Phe Val Gly Ser He Asp Gly Val 
165 170 175 

Glu Phe Asp Gly Gly Lys Gly Glu Asn Phe Ser Leu Gly Leu Gly Ser 
180 185 190 

Gly Gin Phe He Pro Gly Phe Glu Asp Gin Leu Val Gly His Ser Ala 
195 . 200 205 

Gly Glu Thr Val Asp Val He Val Thr Phe Pro Glu Asp Tyr Gin Ala 
210 215 220 

Glu Asp Leu Ala Gly Lys Glu Ala Lys Phe Val Thr Thr lie His Glu 
225 230 235 240 

Val Lys Ala Lys Glu Val Pro Ala Leu Asp Asp Glu Leu Ala Lys Asp 
245 250 255 

He Asp Glu Glu Val Glu Thr Leu Ala Asp Leu Lys Glu Lys Tyr Ser 
260 265 270 

Lys Glu Leu Ala Ala Ala Lys Glu Glu Ala Tyr Lys Asp Ala Val Glu 
275 280 285 

Gly Ala Ala He Asp Thr" Ala Val Glu Asn Ala Glu He Val Glu Leu 
290 295 300 

Pro Glu Glu Met He His Glu Glu. Val His Arg Ser Val Asn Glu Phe 
305 310 . 315 320 

Leu_Gly-Asn__Leu_Gln_.Arg Gin Gly He Asn Pro Asp Met Tyr Phe Gin 

3 25 .,3 30 . 335 

He Thr Gly Thr Thr Gin Glu Asp Leu His Asn Gin Tyr Gin Ala Glu 
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340 345 350. 

Ala Glu Ser Arg Thr Lys Thr Asn Leu Val lie Glu Ala Val Ala Lys 

355 360 - - - 365 

Ala Glu Gly Phe Asp Ala Ser Glu Glu Glu lie Gin Lys Glu Val Glu 

370 375 _ 380 

. Gin Leu Ala Ala Asp Tyr Asn Met Glu Val Ala Gin Val Gin Asn Leu 
, 385 - 390 395 400 

Leu Ser Ala Asp Met Leu Lys His Asp lie Thr lie Lys Lys Ala Val 
405 410 .415 

Glu Leu lie Thr Ser Thr Ala Thr Val Lys 
420 425 

(2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) "" LENGTH ;-490 " base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

TGGTCAAAAG GAAAGTCAGA CAGGAAAGGG GATGAAAATT GTGACCAGTT TTTATC CTAT 60 

CTACGCTATG GTTAAGGAAG TATC TGGTGA CTTGAATGAT GTTC GGATG A TTCAGTCAAG 120 

TAGTGGTATT CACTCCTTTG AACCTTCGGC AAATGATATC GCAGCCATCT ATGATGCAGA 180 

TGTC TTTGTT TACCATTCTC ATACACTCGA ATCTTGGGCA GGAAGTC TGG ATCCAAATCT 240 

AAAAAAATCC AAAGTGAAGG TCTTAGAGGC TTCTGAGGGA ATGAC C TTGG AACGTGTCCC 3 00 

TGGACTAGAG GATGTGGAAG CAGGGGATGG AGTTGATGAA AAAACGCTCT ATGACCCTCA 3 60 

CACATGGCTA GATCCTGAAA AAGCTGGAGA AGAAGCCCAA ATTATCGCTG ATAAACTTTC 420 



AGAGGTGGAT AGTGAGCATA AAGAGACTTA TCAAAAAAAT GCGCAACCTT TATCAAAAAA 
GCTCAGGAAT 

r 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: "protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Gly Gin Lys Glu Ser Gin Thr Gly Lys Gly Met Lys lie Val Thr Ser 

— i .5 10 ...„.' -15 
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Phe Tyr Pro lie Tyr Ala Met Val Lys Glu Val Ser Gly Asp Leu Asn 

20 .. _ 25 . .30 

. Asp Val Arg Met lie Gin Ser Ser Ser Gly lie His. Ser^Phe Glu Pro 
__ 3 5 40 45 

Ser Ala Asn Asp lie Ala Ala lie Tyr Asp Ala Asp Val Phe Val Tyr. 
50 55 60 

His Ser His Thr Leu Glu Ser Trp Ala Gly Ser Leu Asp Pro Asn Leu 
65 70 75 80 

Lys Lys Ser Lys Val . Lys Val Leu Glu Ala Ser Glu Gly Met Thr Leu 

85. 90 95. 

Glu Arg Val Pro Gly Leu Glu Asp Val Glu Ala Gly Asp Gly Val Asp 
100 . 105 110 . 

Glu Lys Thr Leu Tyr Asp Pro His Thr Trp Leu Asp Pro Glu Lys Ala 

O • 115 120 125 ~- - - 

XI Gly Glu Glu Ala Gin lie lie Ala Asp Lys Leu Ser Glu Val Asp Ser 

i]P 13 0 13 5 - 140 

Glu His Lys Glu Thr Tyr Gin Lys Asn Ala Gin Pro Leu Ser Lys Lys 
145 150 155 160 

—J: 

s=as Leu Arg Asn 

M (2) INFORMATION FOR SEQ ID NO: 51: 

p=J. (i) SEQUENCE CHARACTERISTICS: ' ; 

•=5 (A) LENGTH : 1006 base pairs 

(B). TYPE: nucleic acid 
^ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GAAGGATAGA TATATTTTAG CATTTGAGAC ATCCTGTGAT GAGACCAGTG TCGCCGTCTT 60 

G AAAAAC G AC GATGAGCTCT TGTCCAATGT CATTGCTAGT CAAATTGAGA GTCACAAACG 12 0 

TTTTGGTGGC GTAGTGCCCG AAGTAGCCAG TCGTCACCAT GTCGAGGTCA TTACAGCCTG 180 

TATCGAGGAG GCATTGGCAG AAGCAGGGAT TACCGAAGAG GACGTGACAG CTGTTGCGGT 240 

TACCTAC GGA CCAGGCTTGG TCGGAGCCTT GCTAGTTGGT TTGTCAGCTG CCAAGGCCTT 300 

TGCTTGGGCT CACGGACTTC CACTGATTCC TGTTAATCAC ATGGCTGGGC ACCTCATGGC 3 60 

AGCTCAGAGT GTGGAGCCTT TGGAGTTTCC CTTGCTAGCC CTCTTGGTCA GCGGCGGACA 42 0 

CACAGAGTTG GTTTATGTTT CGGAGGCAGG AGATTATAAG ATTGTTGGGG AAACCCGTGA 480 

TGATGCGGTT GGTGAGGCTT ATGATAAGGT CGGCCGTGTC ATGGGCTTGA CCTATCCTGC 540 
AGGTCGTGAG ATTGACGAGC TGGCTCATCA GGGGCAGGAT ATTTATGATT TCCCCCGTGC__ . 600 

CATGATTAAG GAAGATAATC TGGAGTTCTC CTTCTCAGGT TTGAAATCTG CCTTTATCAA 660 
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TCTTCATCAC AATGCCGAGC AAAAGGGAGA AAGCCTGTCT ACAGAAGATT TGTGTGCTTC 

CTTCCAAGCA GCAGTTATGG ACATTCTCAT GGCAAAAACC AAGAAGGCTT TGGAGAAATA 

TCCTGTTAAA ATCCTAGTTG TGGCAGGTGG TGTGGCAGCC AATAAAGGTC TC AGAGAAC G 

CCTAGCAGCC G AAAT C AC AG ATGTCAAGGT TATCATCCCC CCTCTGCGAC TCTGCGGAGA 

CAATGCAGGT ATGATTGCCT ATGCCAGCGT CAGCNAGTGG AACAAAGAAA ACTTCGCAGG 

CTGGGACCTC. AATGCCAAAC CAAGTCTTGC CTTTGATACC ATGGAA 
(2) INFORMATION FOR SEQ ID NO: 52: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr Ser 
1 5 10 15 

Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val lie Ala 
20 25 30 

Ser Gin lie Glu Ser His Lys Arg Phe Gly Gly Val Val Pro Glu Val 
35 40 45 

Ala Ser Arg His His Val Glu Val lie Thr Ala Cys lie Glu Glu Ala 
50 55 60 

Leu Ala Glu Ala Gly lie Thr Glu Glu Asp Val Thr Ala Val Ala Val 
65 • 70 75 80 

Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser Ala 
85 90 95 

Ala- Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu lie Pro Val Asn 
100 105 110 

His Met Ala Gly His Leu Met Ala Ala Gin Ser Val Glu Pro Leu Glu 
115 .120 125 

Phe Pro Leu Leu Ala Leu Leu Val Ser Gly Gly His Thr Glu Leu Val 
130 135 140 . 

Tyr Val Ser Glu Ala Gly Asp Tyr Lys lie Val Gly Glu Thr Arg Asp 
145 150 155 160 

Asp Ala Val Gly Glu Ala Tyr Asp Lys Val Gly Arg Val Met Gly Leu 
165 ; 170 175 

Thr „Tyr__P.r.o_Ala„Gly__ Arg Glu lie Asp Glu Leu Ala His Gin Gly Gin 

180 185 ... 190 

Asp lie Tyr Asp Phe Pro Arg Ala Met lie Lys Gflu Asp Asn Leu Glu 
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200 



205 



Phe Ser Phe Ser 

210 . 

Ala Glu Gin Lys 
225 

Phe Gin Ala Ala 



Leu Glu Lys Tyr 
260 

Ala Asn Lys Gly 
275 



Gly Leu Lys Ser 
215 

Gly Glu Ser Leu 
230 

Val Met Asp lie 
245 

Pro Val Lys lie 



Leu Arg Glu Arg 
280 



Ala Phe lie Asn 
220 

Ser Thr Glu Asp 
235 

Leu Met Ala Lys 
250 

Leu Val Val Ala 
265 

Leu Ala Ala Glu 



Leu His His Asn 



Leu Cys Ala Ser 
240 

Thr Lys Lys Ala 
255 

Gly Gly Val Ala 
270 

lie Thr Asp Val 
285 



Lys Val He He Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly Met 
290 295 300 



lie "Ala Tyr Ala— Ser Val— Ser Xaa Trp Asn Lys Glu Asn Phe Ala Gly 

305 310 315 320 

Trp Asp Leu Asn Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1000 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GGTAGTTAAA GTTGGTATTA ACGGTTTCGG ACGTATCGGT CGTCTTGCTT TCCGTCGTAT 60 

CCAAAACGTA GAAGGTGTTG AAGTTACACG CATCAACGAC CTTACAGATC CAGTTATGCT 120 

TGCACACTTG TTGAAATACG ACACAACTCA AGGTCGTTTC GACGGTACTG TTGAAGTTAA 180 

AGAAGGTGGA TTTGAAGTTA ACGGTAAATT CATCAAAGTT TCTGCTGAAC GTGATCCAGA 240 

AC AAATC GAC TGGGCTACTG ACGGTGTAGA AATCGTTCTT GAAGCTACTG GTTTCTTTGC 3 00 

TAAGAAAGAA GCAGCTGAAA AACACCTTAA AGGTGGAGCT AAAAAAGTTG TTATCACTGC 3 60 

TCCTGGTGGA AACGACGTTA AAACAGTTGT ATTCAACACT AACCACGACG TTCTTGACGG 420 

TACTGAAACA GTTATCTCAG GTGCTTCATG TACTACAAAC TGCTTGGCTC CAATGGCTAA 48 0 

AGCTCTTCAA GACAACTTTG GTGTTGTTGA AGGATTGATG ACTACTATCC ACGCTTACAC 540 

TGGTGACCAA ATGATCCTTG ACGGACCACA CCGTGGTGGT GACCTTCGCC GTGCTCGCGC 6 00 

TGGTGCTGCA AACATCGTTC CTAACTCAAC TGGTGCTGCA AAAGCTATCG GTCTTGTAAT 660 



CCCAGAATTG AATGGTAAAC TTGACGGATC TGCACAACGC GTTCCAACTC CAACTGGATC ,72 0 

AGTTACTGAA TTGGTAGCAG TTCTTGAAAA GAACGTTACT GTTGATGAAG TGAACGCAGC 78 0 
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TATGAAAGCA GCTTCAAACG AATCATACGG TTACACAGAA GATCCAATCG TATCTTCAGA 840 

TATCGTAGGT ATGTCTTACG GTTCATTGTT TGACGCAACT CAAACTAAAG TTCTTGACGT 900 

TGAC GGT AAA CAATTGGTTA AAGTTGTATC ATGGTACGAC AACGAAATGT CAT AC AC TGC 960 

ACAACTTGTT CGTACTCTTG GAATACTTCG CAAAAATTGC 1000 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : , 

Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu Ala 
1 5 10 15 

Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg lie Asn 
20 25 30 



to 



Asp Leu Thr Asp Pro Val Met Leu Ala His Leu Leu Lys Tyr Asp Thr 
35 40 45 

Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly Phe 
50 55 60 

Glu Val Asn Gly Lys Phe lie Lys Val Ser Ala Glu Arg Asp Pro Glu 
65 70 75 80 

Gin He Asp Trp Ala Thr Asp Gly Val Glu He Val' Leu Glu Ala Thr 
85 90 95 

Gly Phe Phe Ala Lys Lys Glu Ala Ala Glu Lys His Leu Lys Gly "Gly 
100 105 110 

Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val Lys Thr 
115 120 125 

- Val Val Phe Asn Thr Asn His Asp Val Leu Asp Gly Thr Glu Thr Val 
130 135 140 . 

He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met Ala Lys 
145 " 150 155 -160 

Ala Leu Gin Asp Asn Phe Gly Val Val Glu Gly Leu Met Thr Thr He 
165 170 175 

His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His Arg Gly 
180 185 190 

Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val Pro Asn 
195 200 205 



Ser Thr Gly Ala Ala Lys Ala lie Gly Leu-Val He Pro Glu Leu Asn 
210 215 220 
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-Gly Lys Leu Asp Gly Ser Ala Gin Arg Val Pro Thr Pro Thr Gly Ser 

225 230 235 240 

Val Thr Glu Leu Val Ala Val Leu Glu Lys Asn Val Thr Val Asp Glu 
245 250 255 

Val Asn Ala Ala Met Lys Ala Ala Ser Asn Glu Ser Tyr Gly.. Tyr Thr 

260 265 270 

Glu Asp Pro lie Val Ser Ser Asp lie Val Gly Met Ser Tyr Gly Ser 

275 280 . 285 

Leu Phe Asp Ala Thr Gin Thr Lys Val Leu Asp Val Asp Gly Lys Gin 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TTCTTAC GAG TTGGGACTGT ATCAAGCTAG AACGGTTAAG GAAAATAATC GTGTTTCCTA 60 

TATAGATGGA AAACAAGCGA CGCAAAAAAC GGAGAATTTG ACTCCTGATG AGGTTAGCAA 12 0 

GCGTGAAGGA ATCAATGCTG AGCAAATCGT CATCAAGATA ACAGACCAAG GCTATGTCAC 180 

TTCACATGGC GAC CACTATC ATTATTACAA TGGTAAGGTT CCTTATGACG CTATCATCAG 240 

TGAAGAATTA CTCATGAAAG ATCCAAACTA TAAGCTAAAA GATGAGGATA TTGTT AATG A 300 

GGTCAAGGGT GGATATGTTA TCAAGGTAGA TGGAAAATAC TATGTTTACC TTAAGGATGC 3 60 

TGCCCACGCG GATAACGTCC GTACAAAAGA GGAAATCAAT CGACAAAAAC AAGAGCATAG 420 

TCAACATCGT GAAGGTGGAA CTCCAAGAAA CGATGGTGCT GTTGCCTTGG CACGTTCGCA 48 0 

AGGACGCTAT ACTACAGATG ATGGTTATAT CTTTAATGCT TCTGATATCA TAGAGGATAC 540 

TGGTGATGCT TATATCGTTC CTCATGGAGA TCATTACCAT TACATTCCTA AGAATGAGTT 600 

ATCAGCTAGC GAGTTGGCTG CTGCAGAAGC CTTCCTATCT GGTC GAGGAA ATCTGTCAAA 660 

TTCAAGAACC TATCGCCGAC AAAATAGCGA TAACACTTCA AGAACAAACT GGGTAC CTTC 720 

TGTAAGCAAT CCAGGAACTA CAAATACTAA CACAAGCAAC AACAGCAACA CTAACAGTCA 780 

AGCAAGTCAA AGTAATGACA TTGATAGTCT CTTGAAACAG CTCTACAAAC TGCCTTTGAG 84 0 

TCAACGACAT GTAGAATCTG ATGGCCTTGT CTTTGATCCA. .GCACAAATCA CAAGTCGAAC 900 

AGCTAGAGGT GTTGCAGTGC CACACGGAGA TCATTACCAC . TTCATCCCTT ACTCTCAAAT 960 
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GTCTGAATTG GAAGAACGAA TCGCTCGTAT TATTCCCCTT CGTTATCGTT CAAACCATTG 102 5 

GGTACCAGAT TCAAGGCCAG _AACAACCAAG TCCACAACCG ACTCCGGAAC CTAGTCCAGG 108 0 

CCGGCAACCT GCACCAAATC TTAAAATAGA CTCAAATTCT TCTTTGGTTA GTCAGCTGGT 114 0 
AC G AAAAGTT GGGGAAGGAT ATGTATTCGA AGAAAAGGGC ATCTCTCGTT ATGTC TTTGC ,1200 

GAAAGATTTA CCATCTGAAA CTGTTAAAAA TCTTGAAAGC AAGTTATCAA AACAAGAGAG 12 60 

TGTTTCACAC ACTTTAACTG CTAAAAAAGA AAATGTTGCT CCTCGTGACC AAGAATTTTA 13 2 0 

TGATAAAGCA TATAATCTGT TAACTGAGGC TCATAAAGCC TTGTTTGNAA ATAAGGGTCG 13 80 
TAATTCTGAT TTCCAAGCCT TAGACAAATT ATTAGAACGC TTGAATGATG AATCGACTAA ' 144 0 

TAAAGAAAAA TTGGTAGATG ATTTATTGGC ATTCCTAGCA CCAATTACCC ATCCAGAGCG 1500 

ACTTGGCAAA CCAAATTCTC AAATTGAGTA TACTGAAGAC GAAGTTC GTA TTGCTCAATT 1560 

AGCTGATAAG T AT AC AAC GT CAGATGGTTA CATTTTTGAT GAACATGATA TAATCAGTGA 162 0 

TGAAGGAGAT GCATATGTAA CGCCTCATAT GGGCCATAGT CACTGGATTG GAAAAGATAG 168 0 

CCTTTCTGAT AAGGAAAAAG TTGCAGCTCA AGCCTATACT AAAGAAAAAG GTATCCTACC 1740 

TCCATCTCCA GACGCAGATG TTAAAGCAAA TCCAACTGGA GATAGTGCAG CAGCTATTTA 18 00 

CAATCGTGTG AAAGGGGAAA AACGAATTCC ACTCGTTCGA CTTCCATATA TGGTTGAGCA 18 60 

TACAGTTGAG GTTAAAAACG GTAATTTGAT TATTCCTCAT AAGGATCATT ACCATAATAT 1920 

TAAATTTGCT TGGTTTGATG ATCACACATA CAAAGCTCCA AATGGCTATA CCTTGGAAGA 198 0 

TTTGTTTGCG ACGATTAAGT ACTAC GTAGA ACACCCTGAC GAACGTCCAC ATTCTAATGA 2 040 

TGGATGGGGC AATGCCAGTG AGCATGTGTT AGGCAAGAAA GACCACAGTG AAGATCCAAA 2100 

TAAGAACTTC AAAGC GGATG AAGAGCCAGT AGAGGAAACA CCTGCTGAGC CAGAAGTCCC 2160 

TCAAGTAGAG ACTGAAAAAG TAGAAGCCCA ACTCAAAGAA GCAGAAGTTT TGCTTGCGAA 222 0 

AGTAACGGAT TCTAGTCTGA AAGCCAATGC AACAGAAACT CTAGCTGGTT TACGAAATAA 2280 
TTTGACTCTT CAAATTATGG ATAACAATAG TATCATGGCA GAAGGAGAAA AATTAC TTGC . 234 0 

GTTGTTAAAA GGAAGTAATC CTTCATCTGT AAGTAAGGAA AAAATAAAC 238 9 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi)__SEQUENCE DESCRIPTION i_ SEQ ID NO: 56: ~ 
Ser Tyr Glu Leu Gly Leu Tyr Gin Ala Arg Thr Val Lys Glu Asn Asn 



1 



5 



10 



15 



Arg Val Ser Tyr lie Asp Gly Lys Gin Ala Thr Gin Lys Thr Glu Asn 
20 •-- 25 . 30 

Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly lie Asn Ala Glu Gin 

35 40 45 

lie Val lie Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His Gly Asp 
50 55 60 

His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala He He Ser 
. 65 70 75 80 

Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Lys Leu Lys Asp Glu Asp 

85 90 95 

He Val Asn Glu Val Lys Gly Gly Tyr Val He Lys Val Asp Gly Lys 
100 105 110 



Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala AspAsh Val" Arg Thr 
115 120 125 

Lys Glu Glu He Asn Arg Gin Lys Gin Glu His Ser Gin His Arg Glu 
130 135 * 140 

Gly Gly Thr Pro Arg Asn Asp Gly Ala Val Ala Leu Ala Arg Ser Gin 
145 150 155 160 

Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn Ala Ser Asp He 
165 170 ' 175 

He Glu Asp Thr Gly Asp Ala Tyr He Val Pro His Gly Asp His Tyr 
180 185 190 

His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala Ala Ala 
195 200 205 

Glu Ala Phe Leu Ser Gly Arg Gly Asn Leu Ser Asn Ser Arg Thr Tyr 
210 215 220 

Arg Arg Gin Asn Ser Asp Asn Thr Ser Arg Thr Asn Trp Val Pro Ser 
225 230 235 240 

Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser Asn Asn Ser Asn 
245 250 255 

Thr Asn Ser Gin Ala Ser Gin Ser Asn Asp He Asp Ser Leu Leu Lys 
260 265 270 

Gin Leu Tyr Lys Leu Pro Leu Ser Gin Arg His Val Glu Ser Asp Gly 

275 280 285 . . . 

Leu Val Phe Asp Pro Ala Gin He Thr Ser Arg Thr Ala Arg Gly Val 
290 295 300 

Ala Val Pro His Gly Asp His Tyr His Phe He Pro Tyr Ser Gin Met 
305 310 315 320 

Ser Glu Leu Glu Glu Arg He Ala Arg He He Pro Leu Arg Tyr Arg 
325 330 335 



Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin Pro Ser Pro Gin 



340 



167 
345 



350 



Pro Thr Pro Glu 
355 

lie Asp Ser Asn 
370 

Glu Gly Tyr Val 
385 

Lys Asp Leu Pro 



Lys Gin Glu Ser 
420 

Ala Pro Arg Asp 
435 

Glu Ala His Lys 
450 



Pro Ser Pro Gly 
360 

Ser Ser Leu Val 
375 

Phe Glu Glu Lys 
390 

Ser Glu Thr Val 
405 

Val Ser His Thr 



Gin Glu Phe Tyr 
440 

Ala Leu Phe Xaa 
455 



Pro Gin Pro Ala 



Ser Gin Leu Val 
380 

Gly . lie Ser Arg 
395 

Lys Asn Leu Glu 
410 

Leu Thr Ala Lys 
425 

Asp Lys Ala Tyr 



Asn Lys Gly Arg 
460 



Pro Asn Leu Lys 
365 

Arg Lys Val Gly 



Tyr Val Phe Ala 
400 

Ser Lys Leu Ser 
415 

Lys Glu Asn Val 
430 

Asn Leu Leu Thr 
445 

Asn Ser Asp Phe 



Gin Ala Leu Asp Lys Leu Leu Glu Arg Leu Asn Asp Glu Ser Thr Asn 

465 470 475 480 

Lys Glu Lys Leu Val Asp Asp Leu Leu Ala Phe Leu Ala Pro lie Thr 

485 490 495 

His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gin lie Glu Tyr Thr Glu 

500 505 510 

Asp Glu Val Arg lie Ala Gin Leu" Ala Asp Lys Tyr Thr Thr Ser Asp 

515 520 525 

Gly Tyr lie Phe Asp Glu His Asp lie lie Ser Asp Glu Gly Asp Ala 

530 535 540 

Tyr Val Thr Pro His Met Gly His Ser His Trp He Gly Lys Asp Ser 

545 550 555 . 560 

Leu Ser Asp Lys Glu Lys Val Ala Ala Gin Ala Tyr Thr Lys Glu Lys 

565 570 575 

Gly He Leu Pro Pro Ser Pro Asp Ala Asp Val Lys Ala Asn Pro Thr 

580 585 " 590 

Gly Asp Ser Ala Ala Ala He Tyr Asn Arg Val Lys Gly Glu Lys Arg 

595 600 605 

He Pro Leu Val Arg Leu Pro Tyr Met Val Glu His Thr Val Glu Val 

610 615 620 

Lys Asn Gly Asn Leu He He Pro His Lys Asp His Tyr His Asn He 

625 _ 630 635 640 

Lys Phe Ala Trp Phe Asp Asp His Thr Tyr Lys Ala Pro Asn Gly Tyr 

645 650 655 

-Thr-Leu-G-lu-Asp- Leu Phe Ala Thr He Lys Tyr Tyr Val Glu His Pro 

660 - 665 670 

Asp Glu Arg Pro His Ser Asn Asp Gly Trp Gly Asn Ala Ser Glu His 
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. 675 680 685 

Val Leu Gly Lys Lys Asp His Ser Glu Asp Pro Asn Lys Asn Phe Lys 
690 695 700 

Ala Asp Glu Glu Pro Val Glu Glu Thr Pro Ala Glu Pro Glu Val Pro 
705 710 715 720 

Gin Val Glu Thr Glu Lys Val Glu Ala Gin Leu Lys Glu Ala Glu Val 
725 730 735 

Leu Leu Ala Lys Val Thr Asp Ser Ser Leu Lys Ala Asn Ala Thr Glu 
740 745 750 

Thr Leu Ala Gly Leu Arg Asn Asn Leu Thr Leu Gin lie Met Asp Asn 
755 760 765 

Asn Ser lie Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu Lys Gly 
770 _ 775 780 

Ser Asn : ~Pf6""Ser " Sef~Val Ser Lys Glu Lys He Asn 
785 790 795 

INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TACTGAGATG CATCATAATC TAGGAGCTGA AAAGCGTTCA GCAGTGGCTA CTACTATCGA 60 

TAGTTTTAAG GAGCGAAGTC AAAAAGTCAG AGCACTATCT GATCCAAATG TGCGTTTTGT 120 

TCCCTTCTTT GGCTC TAGTG AATGGCTTCG TTTTGACGGT GCTCATTCTG CGGTATTAGC 180 

TGAGAAATAC AATCGTTCCT ACCGTCCTTA TCTTTTAGGA CAGGGGGGAG CTGCATCGCT 240 

TAACCAATAT TTTGGAATGC AACAGATGTT ACCACAGCTG GAGAATAAAC AAGTTGTGTA 3 00 
TGTTATCTCA CCTCAGTGGT TCAGTAAAAA TGGCTATGAT CCAGCAGCCT TCCAGCAGTA _ 3 60 

TTTTAATGGA GACCAGTTGA CTAGTTTTCT GAAACATCAA TCTGGGGATC AGGCTAGTCA 420 

ATATGCAGCG ACTCGCTTAC TGCAACAGTT CCCAAACGTA GCTATGAAGG ACCTGGTTCA 48 0 

GAAGTTGGCA AGTAAAGAAG AATTGTCGAC AGCAGACAAT GAAATGATTG AATTATTGGC 540 

TCGTTTTAAT GAACGC CAAG CTTCCTTTTT TGGTCAGTTT TCGGTTAGAG GCTATGTTAA 600 

CTACGATAAG CATGTAGCTA AGTATTTAAA AATCTTGCCA GAC C AGTTTT CTTATCAGGC 660 

AATAGAAGAT GTTGTCAAAG ^CAGATGCTGA AAAAAATACT TCCAATAATG AGATGGGAAT ~ 720 

- GGAAAATTAT TTCTATAATG AGCAGATCAA GAAGGATTTG AAGAAAT-TAA-AGGATTCTCA 780 

GAAAAGCTTT ACCTATCTCA AGTCGC CAGA GTATAATGNN TTGCAGTTGG TTTTAACACA 840 
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GTTTTCTAAA TCTAAGGTAA ACCCGATTTT TATCATTCCA CCTGTTAATA AAAAATGGAT 90p 
GNACTATGCT GGTCTACGAG AGGATATGTA CCAACAAACG GTGCAGAAGA TTCGCTACCA 960 



GTTAGAAAGT CAAGGTTTTA CCAATATAGC AGATTTTTCT AAGGACGGCG GGGAGCCTTT 1020 

CTTTATGAAG GACACCATTC ACCTTGGTTG GTTGGGTTGG TTGGCTTTTG ACAAGGCAGT _ 1080 

TGATCCTTTC CTATCCAATC CCACACCAGC TCCGACTTAC CATCTGAATG AGCGCTTTTT 1140 

CAGCAAAGAT TGGGCGACTT ATGATGGAGA TGTCAAAGAA 1180 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein * " — ~ — ■ -- = 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Thr Glu Met His His Asn Leu Gly Ala Glu Lys Arg Ser Ala Val Ala 
15 10 15 

Thr Thr lie Asp Ser Phe Lys Glu Arg Ser Gin Lys Val Arg Ala Leu 
20 25 30 

Ser Asp Pro Asn Val Arg Phe Val Pro Phe Phe Gly Ser Ser Glu Trp 
35 40 45 

Leu Arg Phe Asp Gly Ala His Ser Ala Val Leu Ala Glu Lys Tyr Asn 
50 55 60 

Arg Ser Tyr Arg Pro Tyr Leu Leu Gly Gin Gly Gly Ala Ala Ser Leu 
65 70 75 80 

Asn Gin Tyr Phe Gly Met Gin Gin Met Leu Pro Gin Leu Glu Asn Lys 
85 90 95 

Gin Val Val Tyr Val lie Ser Pro Gin Trp Phe Ser Lys Asn Gly Tyr 
100 105 110 

Asp Pro Ala Ala Phe Gin Gin Tyr Phe Asn Gly Asp Gin Leu Thr Ser 
115 120 125 

Phe Leu Lys His Gin Ser Gly Asp Gin Ala Ser Gin Tyr Ala Ala Thr 
130 135 140 

Arg Leu Leu Gin Gin Phe Pro Asn Val Ala Met Lys Asp Leu- Val Gin 
145 150 155 160 

Lys Leu Ala Ser Lys Glu Glu Leu Ser Thr Ala Asp Asn Glu Met lie 
165 170 175 

Glu Leu Leu- Ala Arg Phe Asn Glu Arg Gin Ala Ser Phe Phe" Gly Gin* 
180 185 .. _ . 190 



Phe 



Ser 
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3d 



Leu Lys lie Leu Pro Asp Gin Phe . Ser Tyr Gin Ala lie Glu Asp Val 
210 215 • 220 

Val Lys Ala Asp Ala Glu Lys Asn Thr Ser Asn Asn Glu Met Gly Met 
225 230 235 240 

Glu- Asn Tyr Phe Tyr Asn Glu Gin lie Lys Lys Asp Leu Lys Lys Leu' 
245 250 255 

Lys Asp Ser Gin Lys Ser Phe Thr Tyr Leu Lys Ser Pro Glu Tyr Asn 
260 265 270 

Xaa Leu Gin Leu Val Leu Thr Gin Phe Ser Lys Ser Lys Val Asn Pro 
275 280 285 

lie Phe lie lie Pro Pro Val Asn Lys Lys Trp : Met Xaa Tyr Ala Gly 
290 295 300 

Leu Arg Glu Asp Met Tyr Gin Gin Thr Val Gin Lys lie Arg Tyr Gin 
305 310 315 320 

Leu Glu Ser Gin Gly Phe Thr Asn He Ala Asp Phe Ser Lys Asp Gly 
325 330 335 

Gly Glu Pro Phe Phe Met Lys Asp Thr He His Leu Gly Trp Leu Gly 
340 345 350 

Trp Leu Ala Phe Asp Lys Ala Val Asp Pro Phe Leu Ser Asn Pro Thr 
355 360 365 

Pro Ala Pro Thr Tyr His Leu Asn Glu Arg Phe Phe Ser Lys Asp Trp 
370 375 380 

Ala Thr Tyr Asp Gly Asp Val Lys Glu 
385 ' 390 

(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GGTTTTGAGA 


AAGTATTTGC 


AGGGGGCCCT 


GATTGAGTCG 


ATTGAGCAAG 


TGGAAAATGA 


60 


CCGTATTGTG 


GAAATTACAG 


TTTC C AATAA 


AAAC GAG ATT 


GGAGACCATA 


TCCAGGCTAC 


120 


CTTGATTATC 


GAAATTATGG 


GGAAACACAG 


TAATATTCTA 


CTGGTC GATA 


AAAGCAGTCA . 


180 


TAAAATC CTC 


GAAGTTATCA 


AACACGTCGG 


CTTTTCACAA 


AATAGCTACC 


GCACCTTACT 


240 


TCCAGGATCG 


ACCTATATCG 


CTCCGCCAAG 


TACAAAATCT 


CTCAATCCTT 


TTACTATCAA 


300 


GGATGAAAAG 


CTCTTTGAAA 


TCC TGCAAAC 


CCAAGAACTA 


ACAGCAAAAA 


ATCTTCAAAG 


360 


CCTCTTTCAA 


GGTCTGGGAC 


GCGATACGGC 


AAATGAATTG 


GAAAGGATAC 


TGGTTAGTGA 


420„ 



AAAACTTTCC GCTTTCCGAA ATTTTTTCAA TCAAGAAACC AAGCGATGCT TGACTGAGAC 
TTCCTTCAGT CCAGTTCCTT TTGCAAATCA GGTGGGAGAG CCTTTTGCAA ATCTTTCTGA 
TTTGTTGGAC ACCTAC TATA AGGATAAGGC TGAGCGCGAC CGCGTCAAAC AGCAGGCCAG 
TGAACTGATT CGTCGTGTTG AAAATGAACT TCAGAAAAAC C G AC AC AAAC TCAAAAAACA 
GGAAAAAGAG TTACTGGCGA C AG AC AAC GC TGAAGAATTT CGTCAAAAAG GAGAATTGCT 
GACAACCTTC CTCCACCAAG TGCCTAACGA CCAAGACCAG GTTATCCTAG ACAACTACTA 
TACCAACCAA CCTATCATGA TTGCGCTTGA TAAGGCTCTG ACTCCCAACC AGAATGCCCA 
ACGCTATTTT AAAC GGTATC AGAAACTCAA AG AAGC TGTC AAATACTTGA CTGATTTGAT 
TGAAGAAACC AAAGCCACTA TTCTCTATCT GGAAAGTGTA GAAACCGTCC TCAACCAAGC 
TGGACTGGAA GAAATCGCTG AAATCCGTGA AGAATTGATT CAAACAGGTT TTATC C GC AG 



\AAGAC AAC GG~GAGAAAATCC "~~ AGAAACGC AA AAAACTAGAA CAATATCTAG CAAGCGATGG 

C AAAAC CATC ATCTATGTCG GACGAAACAA TCTTCAAAAT GAGGAATTGA CCTTTAAAAT 

GGCCCGCAAG GAGGAACTTT GGTTCCATGC TAAGGACATT CCTGGAAGCC ATGTTGTCAT 

CTCAGGAAAT C TTGACC C AT CTGATGCAGT CAAGACAGAC GCAGCAGAGT TAGCTGCCTA 

CTTCTCTCAA GGGCGCCTGT CGAATCTGGT GCAGGTAGAT ATGATTGAAG TCAAAAAACT 

CAATAAACCA ACTGGTGGAA AACCCGGCTT TGTC AC TTAC ACAGGACAAA AGACCCTCCG 

CGTCACACCA GACTCCAAAA AAATTGCATC CATGAAAAAA TCC 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 amino acids 

(B) TYPE: amino acid 

- .(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Val Leu Arg Lys Tyr Leu Gin Gly Ala Leu lie Glu Ser lie Glu Gin 
1 5 10 15 

Val Glu Asn Asp Arg lie Val Glu lie Thr Val Ser Asn Lys Asn Glu 

20 25 30 ' 

lie Gly Asp His lie Gin Ala Thr Leu lie lie Glu lie Met Gly Lys 
35 40 45 

His Ser Asn lie Leu Leu Val Asp Lys Ser Ser His Lys lie Leu Glu 
50 55 60 

Val lie Lys His Val Gly Phe Ser Gln-Asn-Ser— Tyr~Arg-Thr Leu Leu 
- -65 ' ~ 70 75 "80 

Pro Gly Ser Thr Tyr lie Ala Pro Pro Ser Thr Lys Ser Leu Asn Pro 



85 



172 



90 



95 



Phe Thr lie Lys Asp Glu Lys Leu Phe Glu lie Leu Gin Thr Gin Glu 

100 - - - - 105 ■ 110 



Leu Thr Ala Lys Asn Leu Gin Ser Leu Phe Gin Gly Leu Gly Arg Asp 
115, 120 .125 

Thr Ala Asn Glu Leu Glu Arg He Leu Val Ser Glu Lys Leu Ser Ala 
130 135 140 

Phe Arg Asn Phe Phe Asn Gin Glu Thr Lys Pro Cys Leu Thr Glu Thr 
145 150 155 160 

Ser Phe Ser Pro Val Pro Phe Ala Asn Gin Val Gly Glu Pro Phe Ala 
165 170 175 

Asn Leu Ser Asp Leu Leu Asp Thr Tyr Tyr Lys Asp Lys Ala Glu Arg 
180 185 190 

Asp Arg Val Lys Gin Gin Ala Ser Glu Leu lie Arg Arg Val~Glu Ash' 
195 200 205 

Glu Leu Gin Lys Asn Arg His Lys Leu Lys Lys Gin Glu Lys Glu Leu 
210 215 220 

Leu Ala Thr Asp Asn Ala Glu Glu Phe Arg Gin Lys Gly Glu Leu Leu 
225 230 235 240 

Thr Thr Phe Leu His Gin Val Pro Asn Asp Gin Asp Gin Val He Leu 
245 250 255 

Asp Asn Tyr Tyr Thr Asn Gin Pro He Met He Ala Leu Asp Lys Ala 
260 265"" " 270 

Leu Thr Pro Asn Gin Ash Ala Gin Arg Tyr Phe Lys Arg Tyr Gin Lys 
275 280 285 

Leu Lys Glu Ala Val Lys Tyr Leu Thr Asp- Leu He Glu Glu Thr Lys 
290 295 300 

Ala Thr He Leu Tyr Leu Glu Ser Val Glu Thr Val Leu Asn Gin Ala 
305 310 315 320 

Gly Leu Glu Glu He Ala Glu lie Arg Glu Glu Leu He Gin Thr Gly 
325 330 335 

Phe He Arg Arg Arg Gin Arg Glu Lys He Gin Lys Arg Lys Lys Leu 
340 345 350 

Glu Gin Tyr Leu Ala Ser Asp Gly Lys Thr He He Tyr Val Gly Arg 
355 360 . 365 

Asn Asn Leu Gin Asn Glu Glu Leu Thr Phe Lys Met Ala Arg Lys Glu 
370 375 380" 

Glu Leu Trp Phe His Ala Lys Asp He Pro Gly Ser His Val Val He 
385 390 395 400 

Ser Gly Asn Leu Asp Pro Ser Asp Ala Val Lys Thr Asp Ala Ala Glu 

405 410 -415 



Leu Ala Ala Tyr Phe Ser Gin Gly Arg Leu Ser Asn Leu Val Gin Val 
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420 425 430 

Asp Met lie Glu Val Lys Lys Leu Asn Lys Pro Thr Gly Gly Lys Pro 
435 - 440 445 

Gly Phe Val Thr Tyr Thr Gly Gin Lys Thr Leu Arg Val Thr Pro Asp 
450 455 460 

Ser Lys Lys lie Ala Ser Met Lys Lys Ser 
465 470 

(2) INFORMATION FOR SEQ ID NO; 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Si 



Jlf GACAACATTT ACTATCCATA CAGTAGAGTC AGCACCAGCA GAAGTGAAAG AAATTC TTG A 60 

lU AACAGTAGAA AAAGACAACA ATGGCTATAT TCCCAACCTA ATCGGTCTCT TGGCCAATGC 120 

CCCGACTGTT TTAGAAGCCT AC C AAATTGT CTCATCTATC CACCGTCGCA ACAGCCTGAC 180 

ACCCGTTGAG CGTGAAGTGG TGCAAATCAC GGCAGC CGTG ACCAATGGTT GTGCCTTCTG , 240 

P TGTCGCAGGT CACACAGCCT TTTCCATCAA ACAAATCCAG ATGAATGATG ACTTGATTCA 300 

fU AGCTCTTCGC AATCGTACTC CAATTGAAAC AGATCC TAAA TTGGATACCC TAGCTAAGTT 3 60 

1=3, T AC CTTGGC A GTTATCAATA CCAAGGGTCG TGTAGGAGAT GAAGC CTTGT CTGAGTTTTT 420 

AGAAGCTGGC TACACTCAAC AAAATGC CTT GGATGTGGTT TTTGGTGTCA GCCTAGCAAT 48 0 

CCTCTGTAAC TATGC CAACA ACTTAGCTAA TACACCAATT AATCCAGAAT TGCAACCTTA 540 

TGCC 544 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: . linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Thr Thr Phe Thr lie His Thr Val Glu Ser Ala Pro Ala Glu Val Lys 
1 " 5 10 . 15 

_Glu-Ile_Leu Glu Thr Val Glu Lys Asp Asn Asn Gly Tyr, lie Pro Asn 
20 '.. - 25 30 

Leu lie Gly Leu Leu Ala Asn Ala Pro Thr Val Leu Glu Ala Tyr Gin 
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35 



40 



45 



lie Val Ser Ser lie His Arg Arg Asn Ser Leu Thr Pro Val Glu 

50 - - 55 60 . 



Arg 



Glu Val Val Gin lie Thr Ala Ala Val Thr Asn Gly Cys Ala Phe 

65 70.... 75 



Cys 

80. 



Val Ala Gly His Thr Ala Phe Ser lie Lys Gin lie Gin Met Asn 
85 90 .95 



Asp 



Asp Leu lie Gin Ala Leu Arg Asn Arg Thr Pro lie Glu Thr Asp 
100 105 110 



Pro 



Lys Leu Asp Thr Leu Ala Lys Phe Thr Leu Ala Val lie Asn Thr 
115 120 125 



Lys 



Gly Arg Val Gly Asp Glu Ala Leu Ser Glu Phe Leu Glu Ala Gly 
130 135 140 



Tyr 



"""T~hr t= Gln'"GTn Ash "Ala Leu Asp Val Val Phe Gly Val Ser Leu Ala 
145 150 155 



He 
160 



Leu Cys Asn Tyr Ala Asn Asn Leu Ala Asn Thr Pro He Asn Pro 
165 170 175 



Glu 



Leu Gin Pro Tyr Ala 
180 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GGCTAAGGAA AGAGTGGATG TACTAGCTTA TAAACAGGGG TTGTTTGAAA C G AG AG AGC A , 60 

GGCCAAGCGA GGTGTGATGG CTGGCCTAGT CGTAGCAGTC CTTAATGGAG AAC GGTTTGA 120 

CAAGCCAGGA GAGAAAATTC CAGATGACAC CGAATTAAAA CTCAAGGGGG AGAAACTCAA 18 0 

GTATGTCAGC CGTGGTGGTT TGAAACTGGA AAAGGCCTTG CAGGTCTTTG ATTTGTCGGT 240 

GGATGGCGCG ACTACGATTG ATATC GGGGC CTCTACTGGA GGTTTTACCG ATGTCATGCT 300 

ACAGAATAGT GCCAAGTTGG TCTTTGCAGT CGATGTTGGT ACCAATCAGT TGGCTTGGAA 3 60 

ATTACGCCAA GACCCACGAG TTGTCAGCAT GGAGCAGTTC AATTTCCGCT ATGCTGAAAA 42 0 
GACTGATTTC GAGCAGGAGC CGAGCTTTGC CAGTATTGAT GTGAGTTTCA TTTCCCTTAG - 48 0 

TCTGATTTTG CCAGCCTTGC ACCGTGTCTT GGCTGATCAA GGTCAGGTGG TAGCACTTGT 54 0 

CAAACCTCAG TTTGAGGCAG GACGTGAGCA GATTGGGAAA AATGGAATTA— TTCGAGATGC 600 

TAAGGTTCAT CAGAATGTCC TTGAATCTGT AACAGCTATG GCAGTAGAGG TAGGTTTTTC 660 
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AGTCCTTGGC TTGGACTTTT CTCCCATCCA AGGTGGACAT GGAAATATTG AATTTTTAGC 
GTATTTGAAA AAAGAAAAGT CAGCAAGCAA TCAGATTCTT GCTGAGATTA AAGAAGCAGT 

AG AG AGGGC G CATAGTCAAT TTAAAAATGA A . 

( 2 ) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS': single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Ala Lys Glu Arg Val Asp Val Leu Ala Tyr Lys Gin Gly Leu Phe Glu 

5 ■ 10 " ----- ,. ,15- - 



Thr Arg Glu Gin Ala Lys Arg Gly Val Met Ala Gly Leu Val Val Ala 
20 25 30 

Val Leu Asn Gly Glu Arg Phe Asp Lys Pro Gly Glu Lys lie Pro Asp 
35 40 45 

Asp Thr Glu Leu Lys Leu Lys Gly Glu Lys Leu Lys Tyr Val Ser Arg 
50 55 ' 60 

Gly Gly Leu Lys Leu Glu Lys Ala Leu Gin Val Phe Asp Leu Ser Val 
65 70 75 80 

Asp Gly Ala Thr Thr He Asp He Gly Ala Ser Thr Gly Gly Phe Thr 
85 90 95 

Asp Val Met Leu Gin. Asn Ser Ala Lys ■ Leu Val Phe Ala Val Asp Val 
100 105 110 

Gly Thr Asn Gin Leu Ala Trp Lys Leu Arg Gin Asp Pro Arg Val Val 
115 120 125 

Ser 'Met Glu Gin Phe Asn Phe Arg Tyr Ala Glu Lys Thr Asp Phe Glu 
130 135 140 

Gin Glu Pro Ser Phe Ala Ser He Asp Val Ser Phe lie Ser Leu Ser 
145 - 150 155 160 

Leu He Leu Pro Ala Leu His Arg Val Leu Ala Asp Gin Gly Gin Val 
165 170 175 

Val Ala Leu Val Lys Pro Gin Phe Glu Ala Gly Arg Glu Gin He Gly 
180 185 190 

Lys Asn Gly He lie Arg Asp Ala Lys Val His Gin Asn Val Leu Glu 
-195 200 205 

Ser Val Thr Ala Met Ala Val Glu Val Gly Phe Ser Val Leu Gly Leu 
210 215 220__ 

Asp Phe Ser Pro He Gin Gly Gly -His Gly Asn He* Glu' Phe Leu Ala 
225 230 235 240 
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Tyr Leu Lys Lys Glu Lys Ser Ala Ser Asn Gin lie Leu Ala Glu lie 
245 250 255 

Lys Glu Ala Val Glu Arg Ala His Ser Gin Phe Lys Asn Glu 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

- q _ TTGTTCCTAT GAACTTGGTC GTCACCAAGC TGGTCAGGTT AAGAAAGAGT CTAATCGAGT 60 

~ TTC TTAT AT A GATGGTGATC AGGCTGGTCA AAAGGCAGAA AACTTGACAC CAGATGAAGT 120 

^ CAGTAAGAGG GAGGGGATCA ACGCCGAACA AATNGTNATC AAGATTACGG ATCAAGGTTA 18 0 

' TGTGACCTCT CATGGAGACC ATTATCATTA CTATAATGGC AAGGTTCCTT ATGATGCCAT 24 0 

y= CATCAGTGAA GAGCTCCTCA TGAAAGATCC GAATTATCAG TTGAAGGATT CAGACATTGT 3 00 

Pq CAATGAAATC AAGGGTGGTT ATGTCATTAA GGTAAACGGT AAATACTATG TNTACC TTAA 3 60 

GGATGCAGCT CATGCGGATA ATATTCGGAC AAAAGAAGAG ATTAAACGTC AGAAGCAGGA 420 

^ ACGCAGTCAT AATCATAACT CAAGAGCAGA TAATGCTGTT GCTGCAGCCA GAGCCCAAGG 480 

H ACGTTATACA ACGGATGATG GGTATATCTT CAATGCATCT GATATCATTG AGGACACGGG 54 0 

TGATGCTTAT ATCGTTCCTC ACGGCGACCA TTACCATTAC ATTCCTAAGA ■ ATGAGTTATC 600 

AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG AAGCAGGGAT CTCGTCCTTC . 660 

TTCAAGTTCT AGTTATAATG CAAATCCAGC . TCAACCAAGA TTGTCAGAGA ACCACAATCT 720 

GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC ATTTCAAGCC TTTTAC GTG A 780 

ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT GATGGCCTTA TTTTCGACCC 840 

AGCGCAAATC ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC CCTCATGGTA ACCATTACCA 900 

CTTTATC CCT TATGAACAAA TGTCTGAATT GGAAAAACGA ATTGCTCGTA TTATTC CCCT 960 

TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA GAACAACCAA GTCCACAATC 1020 

GACTCCGGAA CCTAGTCCAA GTCCGCAACC TGCACCAAAT CCTCAACCAG CTCCAAGCAA. 108 0 

TCCAATTGAT GAGAAATTGG TCAAAGAAGC TGTTCGAAAA GTAGGCGATG GTTATGTCTT 1140 

TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT CTTTCAGCAG AAACAGCAGC 1200 

^AGGCAOTj^T_AGCAAACTGG CCAAGCAGGA AAGTTTATCT CATAAGCTAG GAGCTAAGAA 1260 

AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG GCTTATGACT TACTAGCAAG 1320 
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TCGACAAGTT GATTTTGAGG CTTTGGATAA " 1380 

TGATAAAGTC AAGTTAGTGG ANGATATTCT 1440 

ACGTTTAGGA AAACCAAATG CGCAAATTAC 1500 

GTTGGCAGGC AAGTACACAA CAGAAGACGG 1560 

TGATGAGGGG GATGCCTATG TAACTCCACA 1620 

TAGTTTGTCT GAAGCTGAGA GAGCGGCAGC 168 0 

CCCTCCTTCG AC AG AC CATC AGGATTCAGG 1740 

CTACAACCGC GTGAAAGCAG CTAAGAAGGT 1800 

ATATAC TGTA GAAGTCAAAA ACGGTAGTTT 1860 
CATCAAATTT GAGTGGTTTG ACGAAGGCCT ' 192 0 

GGATCTTTTG GCGACTGTCA AGTACTATGT 198 0 

TAATGGTTTT GGTAACGCTA GCGACCATGT 2040 

CAATCAAACG GAAAAACCAA GCGAGGAGAA 2100 

CCCTCGAGAA GAGAAACCGC AAAGCGAGAA 2160 

AG AAGAATC A CCAGAGGAAT CAGAAGAACC 2220 

ACTGAGAGAG GCTGAAGATT TACTTGGAAA 2280 

2290 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Cys Ser Tyr Glu Leu Gly Arg His Gin Ala Gly Gin Val Lys Lys Glu 

1 5 ■ ' . 10 15- 

Ser Asn Arg Val "Ser Tyr lie Asp Gly Asp Gin Ala Gly Gin Lys Ala 

..... 20 25 30 

Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly lie Asn Ala 
_ 3 5 40 45 

Glu Gin Xaa Val lie Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His 

50 .'55 60 

Gly Asp His Tyr His Tyr Tyr Asn Gly Lys-Val— Pro— Tyr— Asp- Ala lie 
65 - . -.70 75 - 80 

lie Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin Leu Lys Asp 



AATTCACCAA GATTTACTTG ATAATAAAGG 
CCTGTTGGAA CGACTCAAGG ATGTCNCAAG 
TGCCTTCTTA GCTCCGATTC GTCATCCAGA 
CTACACTGAT GATGAGATTC AAGTAGCCAA 
TTATATCTTT GATCCTCGTG ATATAAC C AG 
TATGACCCAT AGCCACTGGA TTAAAAAAGA 
CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC 
AAATACTGAG GCAAAAGGAG CAGAAGCTAT 
GCCACTTGAT CGTATGCCTT ACAATCTTCA " 
AATCATACCT CATTATGACC ATTACCATAA 
TTATGAGGCA^CCTAAGGGGT " ATAC TCTTGA 
CGAACATCCA AACGAACGTC CGCATTCAGA 
TCAAAGAAAC AAAAATGGTC AAGC TGAT AC 
ACCTCAGACA GAAAAACCTG AGGAAGAAAC 
AC C AGAGTC T CCAAAACCAA C AG AGGAAC C 
TC AGGTC GAG ACTGAAAAGG TTGAAGAAAA 
AATCCAGGAT 

(2) INFORMATION FOR SEQ ID NO: 66: 
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85 90 95 

Ser Asp lie Val Asn Glu lie Lys Gly Gly Tyr Val lie Lys Val Asn 
100 105 110 ' 

Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp Asn lie 
115 120 125 

Arg Thr Lys Glu Glu lie Lys Arg Gin Lys Gin Glu Arg Ser His Asn 
130 135 140 

His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg Ala Gin "Gly 
145 150 155 160 

Arg Tyr Thr Thr Asp Asp Gly Tyr lie Phe Asn Ala Ser Asp lie lie 
165 170 175 

Glu Asp Thr Gly Asp Ala Tyr lie Val Pro His Gly Asp His Tyr His 
180 185 190 



Tyr lie Pro Lys Asn Glu Leu Ser Ala' Ser Glu Leu Ala Ala" Ala "Glu - 
195 200 205 

Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser Ser Ser Ser Ser 
210 215 220 

Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu Asn His Asn Leu 
225 230 235 240 

Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu Asn lie Ser Ser 
245 250 255 

Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu- Arg His Val Glu 
260 265 . 270 

Ser Asp Gly Leu lie Phe Asp Pro Ala Gin He Thr Ser Arg Thr Ala 
'275 280 285 

Arg Gly Val Ala Val 'Pro His Gly Asn His Tyr His Phe He Pro Tyr 
290 295 300 

Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg He He Pro Leu 
305 310 315 320 

Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin Pro 
325 330 335 

Ser Pro Gin Ser Thr Pro Glu Pro Ser .Pro Ser Pro Gin Pro Ala Pro 
340 345 350 

Asn Pro Gin Pro Ala Pro Ser Asn Pro He Asp Glu Lys Leu Val Lys 
355 360 365. . 

Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly 
370 375 ~ " 380 

Val Ser Arg Tyr He Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala 
385 390 395 400 

Gly He Asp Ser Lys Leu. Ala Lys Gin' Glu Ser Leu Ser His Lys Leu 
405 410 415 



Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn 



420 



179 
425 



430 



Lys Ala Tyr Asp- Leu Leu Ala Arg lie His Gin Asp Leu Leu Asp Asn 
. 435 440 445 

Lys Gly. Arg Gin Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg 
450 . 455 460 

Leu Lys Asp Val Xaa Ser Asp Lys Val Lys Leu Val Xaa Asp lie Leu 
465 470 475 480 

Ala Phe Leu Ala Pro lie Arg His Pro Glu Arg Leu Gly Lys Pro Asn 
485 490 ' 495 

Ala Gin lie Thr Tyr Thr Asp Asp Glu lie Gin Val Ala Lys Leu Ala 
500 505 510 

Gly Lys Tyr Thr Thr Glu Asp Gly Tyr lie Phe Asp Pro Arg Asp lie 
515 520 525 

Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser 
530 535 540 

His Trp lie Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala 
545 550 555 560 

Gin Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His 
565 570 575 

Gin Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala lie Tyr Asn 
580 585 590 . 

Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn 
595 ■ ' 600 605 

Leu Gin Tyr Thr Val Glu Val Lys Asn Gly Ser Leu lie lie Pro His 
610 615 620 

Tyr Asp His Tyr His Asn lie Lys Phe Glu Trp Phe Asp Glu Gly Leu 
625 630 635 . 640 

Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val 
645 650 655 

Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly 
660 665 . 670 

Phe Gly Asn Ala Ser Asp His Val Gin Arg Asn Lys Asn Gly Gin Ala 
675 680 685 

Asp Thr Asn Gin Thr Glu Lys Pro Ser Glu Glu Lys Pro Gin Thr Glu 
690 695 700 

Lys Pro Glu Glu Glu Thr Pro Arg Glu Glu Lys , Pro Gin Ser Glu Lys 
705 710 715 720 

Pro Glu Ser Pro Lys Pro Thr Glu Glu Pro Glu Glu Ser Pro Glu Glu 
725 ' 730 735 

Ser Glu Glu Pro Gin Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg 

. 740 .... . 745 750 



Glu Ala Glu Asp Leu Leu Gly Lys lie Gin Asp 
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755 760 
(2) INFORMATION FOR SEQ ID NO: 67: 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 base pairs • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
TTATAAGGGT GAATTAGAAA AAGG AT AC C A ATTTGATGGT TGGGAAATTT CTGGTTTCGA 
AGGTAAAAAA GACGCTGGCT ATGTTATTAA TCTATCAAAA GATACCTTTA TAAAACCTGT 
ATTCAAGAAA ATAGAGGAGA AAAAGG AG G A AGAAAATAAA CCTACTTTTG ATGTATCGAA 
AAAGAAAGAT AACCCACAAG TAAACCATAG TCAATTAAAT GAAAGTCACA GAAAAGAGGA 
TTTACAAAGA GAAGAGCATT CACAAAAATC TGATTCAACT AAGGATGTTA CAGCTACAGT 
TCTTGATAAA AACAATATCA GTAGTAAATC AACTACTAAC AATCCTAATA AG 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

Tyr Lys Gly Glu Leu Glu Lys Gly Tyr Gin Phe Asp Gly Trp Glu lie 
1 5 .10 15 

Ser Gly Phe Glu Gly Lys Lys Asp Ala Gly Tyr Val lie Asn Leu Ser 
20 25 30 

Lys Asp Thr Phe lie Lys Pro Val Phe Lys Lys lie Glu Glu "Lys Lys 
35 - 40 45 

Glu Glu Glu Asn Lys Pro Thr Phe Asp Val Ser Lys Lys Lys Asp Asn 
50 55 60 

Pro Gin Val Asn His Ser Gin Leu Asn' Glu Ser His Arg Lys Glu Asp 
65 -- 70 75 80 

Leu Gin "Arg Glu Glu His Ser Gin Lys Ser Asp Ser Thr Lys Asp Val 
85 90 95 

Thr Ala Thr Val Leu Asp Lys Asn Asn lie Ser Ser Lys 'Ser Thr Thr 
* — ' 100 " "105 110 



Asn Asn Pro Asn Lys 
115 
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(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GAATGTTCAG GCTCAAGAAA GTTCAGGAAA TAAAATCCAC TTTATCAATG TTCAAGAAGG 
TGGCAGTGAT GCGATTATTC TTGAAAGCAA TGGACATTTT GCCATGGTGG ATACAGGAGA 
AGATTATGAT TTCCCAGATG GAAGTGATTC TCGCTATCCA TGGAGAGAAG GAATTGAAAC 
GTCTTATAAG CATGTTCTAA CAGACCGTGT CTTTCGTCGT TTGAAGGAAT TGGGTGTCCA 
AAAAC TTGAT TTTATTTTGG TGACCCATAC C C ACAGTGAT C ATATTGGAA ~ATGTTGATGA ~ 
ATTACTGTCT ACCTATCCAG TTGACCGAGT CTATCTTAAG AAATATAGTG ATAGTC GT AT 
TACTAATTCT GAACGTC TAT GGGATAATCT GTATGGCTAT GATAAGGTTT TACAGACTGC 
TGCAGAAAAA GGTGTTTCAG TTATTCAAAA TATCACACAA GGGGATGCTC ATTTTCAGTT 
TGGGGACATG GATATTCAGC TCTATAATTA TGAAAATGAA ACTGATTCAT CGGGTGAATT 
AAAGAAAATT TGGGATGACA ATTCCAATTC CTTGATTAGC GTGGTGAAAG TCAATGGCAA 
GAAAATTTAC CTTGGGGGCG ATTTAGATAA TGTTCATGGA GCAGAAGACA AGTATGGTCC 
TCTCATTGGA AAAGTTGATT TGATGAAGTT TAATCATCAC CATGATACCA ACAAATCAAA 
TACCAAGGAT TtCATTAAAA ATTTGAGTCC GAGTTTGATT GTTC AAACTT CGGATAGTCT 
ACCTTGGAAA AATGGTGTTG ATAGTGAGTA TGTTAATTGG CTCAAAGAAC GAGGAATTGA 
GAGAATCAAC GCAGCCAGCA AAGACTATGA TGCAACAGTT TTTGATATTC GAAAAGACGG 
TTTTGTCAAT ATTTCAACAT CCTACAAGCC GATTCCAAGT TTTC AAGCTG GTTGGCATAA 
GAGTGCATAT GGGAACTGGT GGTATCAAGC GCCTGATTCT ACAGGAGAGT ATGCTGTCGG 
TTGGAATGAA ATCGAAGGTG AATGGTATTA CTTTAACCAA ACGGGTATCT TGTTACAGAA 
TCAATGGAAA AAATGGAACA ATCATTGGTT CTATTTGACA GACTCTGGTG CTTCTGCTAA 
AAATTGGAAG AAAATCGCTG GAATCTGGTA TTATTTTAAC AAAGAAAACC AGATGGAAAT 
TGGTTGGATT CAAGATAAAG AGCAGTGGTA TTATTTGGAT GTTGATGGTT CTATGAAGAC 
AGGATGGCTT CAATATATGG GGCAATGGTA TTACTTTGCT CCATCAGGGG AA 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids r 

(B) TYPE: amino acid - : 

<C) STRANDEDNESS : -single . .' 1 ' "■ 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: -— 

Asn Val Gin Ala Gin Glu Ser Ser. Gly Asn Lys lie His Phe lie Asn 
1 5 10 15 

Val Gin Glu Gly Gly Ser Asp Ala lie lie Leu Glu Ser Asn Gly His 
20 25 30 

Phe Ala Met Val Asp Thr Gly Glu Asp Tyr Asp Phe Pro Asp Gly Ser 
35 40 45 

Asp Ser Arg Tyr Pro Trp Arg Glu Gly lie Glu Thr .Ser Tyr Lys His 
50 55 60 

Val Leu Thr Asp Arg Val Phe Arg Arg Leu Lys Glu Leu Gly Val Gin 
65 70 75 80 

Lys Leu Asp Phe lie Leu Val Thr His Thr His Ser Asp His lie Gly 
85 90 95 

Asn Val Asp Glu Leu Leu Ser Thr Tyr Pro Val Asp Arg Val Tyr Leu 
100 105 110 

Lys Lys Tyr Ser Asp Ser Arg lie Thr Asn Ser Glu Arg Leu Trp Asp 
115 120 125 

Asn Leu Tyr Gly Tyr Asp Lys Val Leu Gin Thr Ala Ala Glu Lys Gly 
130 135 140 

Val Ser Val lie Gin Asn lie Thr Gin Gly Asp Ala His Phe Gin Phe 
. 145 - 150 155 160 

Gly Asp Met Asp lie Gin Leu Tyr Asn Tyr Glu Asn Glu Thr Asp Ser 
165 170 175 

Ser Gly Glu Leu Lys Lys lie Trp Asp Asp Asn Ser Asn Ser Leu lie 
180 .185 190 

Ser Val Val Lys Val Asn Gly Lys Lys lie Tyr Leu Gly Gly Asp Leu 
195 200 205 

Asp Asn Val His Gly Ala Glu Asp Lys Tyr Gly Pro Leu lie Gly Lys 
210 215 220 

Val Asp Leu Met Lys Phe Asn His His His Asp Thr Asn Lys Ser Asn 
225 230 235 240 

Thr Lys Asp Phe lie Lys Asn Leu Ser Pro Ser Leu lie Val Gin Thr 
245 250 255 

Ser Asp Ser Leu Pro Trp Lys Asn Gly Val Asp Ser Glu Tyr Val Asn 
260 * 265 270 

Trp Leu Lys Glu Arg Gly He Glu Arg He Asn Ala Ala Ser Lys Asp 
275 280 285 

- Tyr Asp Ala Thr Val Phe Asp Ile__Arg Lys Asp Gly Phe Val Asn He 
290 295 " " 300 
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Ser Thr Ser Tyr Lys Pro lie Pro Ser Phe Gin Ala Gly Trp His Lys' 
- 305- - 310 315 320 

.- Ser- Ala Tyr Gly Asn Trp Trp Tyr Gin Ala Pro Asp Ser Thr Gly Glu 
325 330 335 

Tyr Ala Val Gly Trp Asn Glu lie Glu Gly Glu Trp Tyr Tyr Phe Asn 
340 345 350 

Gin Thr Gly lie Leu Leu Gin Asn Gin Trp Lys Lys Trp Asn Asn His 
355 360 365 

Trp Phe Tyr Leu Thr Asp Ser Gly Ala Ser Ala Lys Asn Trp Lys Lys 
370 375 380 

lie Ala Gly lie Trp Tyr Tyr Phe Asn Lys Glu Asn Gin Met Glu lie 
385 390 395 400 

Gly Trp lie Gin. Asp Lys Glu Gin Trp Tyr Tyr Leu Asp Val Asp Gly 

p .." " 405 * ; 410 415 

C] Ser Met Lys Thr Gly Trp Leu Gin Tyr Met Gly Gin Trp Tyr Tyr Phe 

; ™ . 420 425 430 

'iri 

; ! f s Ala Pro Ser Gly Glu 

¥U 435 

!=& (2) INFORMATION FOR SEQ ID NO: 71: 

s«t ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .18 55 base pairs 
l= t (B) TYPE: nucleic acid - 

(C) STRAND EDNESS : double 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CTTGGGTGTA ACCCATATCC AGCTCCTTCC AGTCTTGTCT TACTACTTTG TCAATGAATT 60 

GAAAAACCAT GAACGCTTGT CTGACTACGC TTCAAGCAAC AGCAACTACA ACTGGGGATA 120 

TGACCCTCAA AACTACTTCT CCTTGACTGG TATGTACTCA AGCGATCCTA AGAATCCAGA 180 

AAAACGAATC GCAGAATTTA AAAACCTCAT CAACGAAATC CACAAACGTG GTATGGGAGC 240 

TATCCTAGAT GTCGTTTATA ACCACACAGC' CAAAGTCGAT CTCTTTGAAG ATTTGGAACC 3 00 

AAACTACTAC CACTTTATGG ATGCCGATGG CACACCTCGA ACTAGCTTTG GTGGTGGACG 3 60 

CTTGGGGACA ACCCACCATA TGACCAAACG GCTCCTAATT GACTCTATCA AATACCTAGT 420 

TGATACCTAC AAAGTGGATG GCTTCCGTTT CGATATGATG GGAGACCATG ACGCCGCTTC 48 0 

TATCGAAGAA GCTTACAAGG CTGCACGCGC CCTCAATCCA AACCTCATCA TGCTTGGTGA 540 

AGGTTGGAGA ACCTATGCCG GTGATGAAAA CATGCCTACT AAAGCTGCTG ACCAAGATTG 600 

GATGAAACAT ACCGATACTG TCGCTGTCTT TTCAGATGAC„ATCCGTAACA ACCTCAAATC 660 

TGGTTATCCA AACGAAGGTC AACCTGCCTT TATCACAGGT GGCAAGCGTG ATGTCAACAC 72 0 



y C 

3: 

p i 
1^ 
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CATCTTTAAA AATCTCATTG CTCAACCAAC TAACTTTGAA GCTGACAGCC CTGGAGATGT 78 0 

CATCCAATAC ATCGCAGCCC ATGATAACTT GACCCTCTTT GACATCATTG CCCAGTCTAT 84 0 

CAAAAAAGAC CCAAGCAAGG CTGAGAACTA TGCTGAAATC CACCGTCGTT TACGACTTGG 900 
AAATCTCATG GTCTTGACAG CTCAAGGAAC TCCATTTATC CACTCCGGTC AGGAATATGG _ 960 

ACGTACTAAA CAATTCCGTG ACCCAGCCTA CAAGACTCCA GTAGCAGAGG ATAAGGTTCC 102 0 

AAACAAATCT CACTTGTTGC GTGATAAGGA CGGCAACCCA TTTGACTATC CTTACTTCAT 108 0 
CCATGACTCT TACGATTCTA GTGATGCAGT CAACAAGTTT G AC TGG AC T A AGGCTACAGA • 1140 

TGGTAAAGCT TATCCTGAAA ATGTCAAGAG CCGTGACTAT ATGAAAGGTT TGATTGCCCT 1200 

TCGTCAATCT ACAGATGCCT TCCGACTTAA GAGTCTTCAA GATATCAAAG ACCGTGTCCA 1260 

CCTCATCACT GTCCCAGGCC AAAATGGTGT GGAAAAAGAG GATGTAGTGA TTGGCTACCA 132 0 

AATCACTGCT CCAAACGGCG ATATCTACGC AGTCTTTGTC" AATGCGGATG AAAAAGCTCG 138 0 

CGAATTTAAT TTGGGAACTG CCTTTGCACA TCTAAGAAAT GCGGAAGTTT TGGCAGATGA 1440 

AAACCAAGCA GGACCAGTCG GAATTGCCAA CCCGAAAGGA C TTGAATG G A CTGAAAAAGG 1500 

CTTGAAATTG AATGCCCTTA CAGCTACTGT TCTTCGAGTC TCTCAAAATG GAACTAGCCA 1560 

TGAGTCAACT GCAGAAGAGA AACCAGACTC AACCCCTTCC AAGCCTGAAC ATCAAAATGA 1620 

AGCTTCTCAC CCTGCACATC AAGACCCAGC TCCAGAAGCT AGACCTGATT CTACTAAACC 168 0 

AGATGC C AAA GTAGCTGATG CGGAAAATAA ACCTAGCCAA GCTACAGCTG ATTCACAAGC 1740 

TGAACAACCA GCACAAGAAG CACAAGCATC ATCTGTAAAA GAAGCGGTTC GAAACGAATC 18 0 0 

GGTAGAAAAC TCTAGCAAGG AAAATATACC TGCAACCCCA GATAAACAAG CTGAA 1855 
( 2 )' INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid - • 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 7 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

Leu Gly Val Thr His lie Gin Leu Leu Pro Val Leu Ser Tyr Tyr Phe 

.1 5 10 15 

Val Asn Glu Leu Lys Asn His Glu. Arg Leu Ser Asp Tyr Ala Ser Ser 
20 25 — 30 

Asn Ser Asn Tyr Asn Trp Gly Tyr Asp Pro Gin Asn Tyr Phe Ser Leu 
35 40 45 

Thr Gly Met Tyr Ser Ser Asp Pro Lys Asn Pro_Glu Lys Arg lie Ala " . 

50 . —55. 60 
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Glu Phe Lys- Asn Leu lie Asn Glu lie His Lys" Arg Gly Met Gly Ala 
65 70 - 75 80 

lie Leu Asp Val Val Tyr Asn His Thr Ala Lys Val Asp Leu Phe Glu 
.85 90 95 

Asp Leu Glu Pro Asn Tyr Tyr His Phe Met Asp Ala Asp Gly Thr Pro 
100 105 : T10 

Arg Thr Ser Phe Gly Gly Gly Arg Leu Gly Thr Thr His His Met Thr 
115 120 125 

Lys Arg Leu Leu "lie Asp Ser lie Lys Tyr Leu Val Asp Thr Tyr Lys 
130 135 140 ^ 

Val Asp Gly Phe Arg Phe Asp Met Met Gly Asp His Asp Ala Ala Ser 
145 150 155 160 

lie Glu Glu Ala Tyr Lys Ala Ala Arg Ala Leu Asn Pro Asn Leu lie 

165 - - 170 175 

Met Leu Gly Glu Gly Trp Arg Thr Tyr Ala Gly Asp Glu Asn Met Pro 
180 185 190 

Thr Lys Ala Ala Asp Gin Asp Trp Met Lys His Thr Asp Thr Val Ala 
195 200 205 

Val Phe Ser Asp Asp lie Arg Asn Asn Leu Lys Ser Gly Tyr Pro Asn 
210 215 220 

Glu Gly Gin Pro Ala Phe He Thr Gly Gly Lys Arg Asp Val Asn Thr 
225 230 235 240 

He Phe Lys Asn Leu He Ala Gin Pro Thr Asn Phe Glu Ala Asp Ser 
245 250 255 

Pro Gly Asp Val He Gin Tyr He Ala Ala His Asp Asn Leu Thr Leu 
260 265 270 

Phe Asp He He Ala Gin Ser He Lys Lys Asp Pro Ser Lys Ala Glu 
275 280 ' 285 

Asn Tyr Ala Glu He His Arg Arg Leu Arg Leu Gly Asn Leu Met Val 
290 295 300 

Leu Thr Ala Gin Gly Thr Pro Phe He His Ser Gly Gin Glu Tyr Gly 
305 310 " 315 320 

Arg Thr Lys Gin Phe Arg Asp Pro Ala Tyr Lys Thr Pro Val Ala Glu 
325 330 335 

Asp Lys Val Pro Asn Lys Ser His Leu Leu Arg Asp Lys Asp Gly Asn 
340 345 350 

Pro Phe Asp Tyr Pro Tyr Phe He His Asp Ser Tyr Asp Ser Ser Asp 
355 360 365 

Ala Val Asn Lys Phe Asp Trp Thr Lys Ala Thr Asp Gly Lys Ala Tyr 
370 375 380 

Pro Glu Asn Val Lys Ser Arg Asp Tyr Met Lys Gly Leu He Ala. Leu_ 
385 390 3.9.5 .... " 400 
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Arg Gin Ser Thr Asp Ala Phe Arg Leu Lys Ser Leu Gin Asp lie Lys 
405 410 415 

_ Asp. Arg Val His Leu lie Thr Val Pro Gly Gin Asn Gly Val Glu Lys 
420 425 . 430 

Glu Asp Val Val lie Gly Tyr Gin lie Thr Ala Pro Asn Gly Asp lie 
435 440 445 

Tyr Ala Val Phe Val Asn Ala Asp Glu Lys Ala Arg Glu Phe Asn Leu 
450 455 460 

Gly Thr Ala Phe Ala His Leu Arg Asn Ala Glu Val Leu Ala Asp Glu 

465 470 . ... 475 480 

Asn Gin Ala Gly Pro Val Gly lie Ala Asn Pro Lys Gly Leu Glu Trp 
485 490 495 

Thr Glu Lys Gly Leu Lys Leu Asn Ala Leu Thr Ala Thr Val Leu Arg 
— 500- - - - 505 510 



Val Ser Gin Asn Gly Thr Ser His Glu Ser Thr Ala Glu Glu Lys Pro 
515 520 525 

Asp Ser Thr Pro Ser Lys Pro Glu His Gin Asn Glu Ala Ser His Pro 
530 535 540 

Ala His Gin Asp Pro Ala Pro Glu Ala Arg Pro Asp Ser Thr Lys Pro 
545 550 555 560 

Asp Ala Lys Val Ala Asp Ala Glu Asn Lys Pro Ser Gin Ala Thr Ala 
565 570 575 

Asp Ser Gin Ala Glu Gin Pro Ala Gin Glu Ala Gin Ala Ser Ser Val 
580 585 590 

Lys Glu Ala Val Arg Asn Glu Ser Val Glu Asn Ser Ser Lys Glu Asn 
595 600 605 

lie Pro Ala Thr Pro Asp Lys Gin Ala Glu 
610 615 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1774 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

TAGTGATGGT ACTTGGCAAG GAAAACAGTA TCTGAAAGAA GATGGCAGTC AAGCAGCAAA 
TGAGTGGGTT TTNGATACTC ATTATCAATC TTGGTTCTAT ATAAAAGCAG ATGCTAACTA 
TGC TGAAAAT GAATGGCTAA AGCAAGGTGA CGACTATTTT TACCTCAAAT CTGGTGGCTA 



"TATGGCCAAA TCAGAATGGG TAGAAGACAA GGGAGCCTTT TATTATCTTG ACCAAGATGG 
AAAGATGAAA AGAAATGCTT GGGTAGGAAC TTCCTATGTT GGTGCAACAG GTGCCAAAGT 
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AATAGAAGAC TGGGTCTATG ATTCTCAATA CGATGCTTGG TTTTATATCA AAGCAGATGG 
AC AGC AC GC A GAGAAAGAAT GGCTCCAAAT TAAAGGGAAG GACTATTATT TCAAATCCGG - 
TGGTTATCTA CTGACAAGTC AGTGGATTAA TCAAGCTTAT GTGAATGCTA GTGGTGCCAA 
AGTACAGCAA GGTTGGCTTT TTGACAAACA ATACCAATCT TGGTTTTACA TCAAAGAAAA 
TGGAAACTAT GCTGATAAAG AATGGATTTT CGAGAATGGT CACTATTATT ATCTAAAATC 
CGGTGGCTAC ATGGCAGCCA ATGAATGGAT TTGGGATAAG GAATCTTGGT TTTATCTCAA 
ATTTGATGGG AAAATGGCTG AAAAAGAATG GGTCTACGAT TCTCATAGTC AAGCTTGGTA 
CTACTTCAAA TCCGGTGGTT AC ATGACAGC CAATGAATGG ATTTGGGATA AGGAATCTTG 
GTTTTACCTC AAATCTGATG GGAAAATAGC TGAAAAAGAA TGGGTCTACG ATTCTCATAG 
TCAAGCTTGG TACTACTTCA AATC TGGTGG CTACATGGCG AAAAATGAGA CAGTAGATGG 
TTATCAGCTT GGAAGCGATG GTAAATGGCT TGGAGGAAAA AC TACAAATG AAAATGCTGC 
TTACTATCAA GTAGTGCCTG TTACAGCCAA TGTTTATGAT TCAGATGGTG AAAAGCTTTC 
CTATATATCG CAAGGTAGTG TCGTATGGCT AGATAAGGAT AGAAAAAGTG ATGACAAGCG 
CTTGGCTATT ACTATTTCTG GTTTGTCAGG C TAT AT G AAA ACAGAAGATT TACAAGCGCT 
AGATGCTAGT AAGGACTTTA TCCCTTATTA TGAGAGTGAT GGCCACCGTT TTTATCACTA 
TGTGGCTCAG AATGCTAGTA TCCCAGTAGC TTCTCATCTT TCTGATATGG AAGTAGGCAA 
GAAATATTAT TCGGCAGATG GCCTGCATTT TGATGGTTTT AAGCTTGAGA ATCCCTTCCT 
TTTCAAAGAT TTAACAGAGG CTACAAACTA CAGTGCTGAA GAATT GGAT A AGGTATTTAG 
TTTGCTAAAC ATTAACAATA GCCTTTTGGA GAACAAGGGC GCTACTTTTA AGGAAGCCGA 
AGAACATTAC CATATCAATG CTCTTTATCT CCTTGCCCAT AGTGCCCTAG AAAGTAACTG 
GGGAAGAAGT AAAATTGCCA AAGATAAGAA TAATTTCTTT GGC ATT AC AG CCTATGATAC 
GACCCCTTAC CTTTCTGCTA AGACATTTGA TGATGTGGAT AAGGGAATTT TAGGTGCAAC 
CAAGTGGATT AAGGAAAATT ATATCGATAG GGGAAGAACT TTCCTTGGAA ACAAGGCTTC 
TGGTATGAAT GTGGAATATG CTTCAGACCC TTATTGGGGC GAAAAAATTG C TAGTGTGAT 
GATGAAAATC AATGAGAAGC TAGGTGGCAA AGAT 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single. 
. (D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein _ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
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Ser Asp Gly Thr Trp Gin Gly Lys Gin Tyr Leu Lys Glu Asp Gly Ser 
15 10 15 



Gin Ala Ala Asn Glu Trp Val Xaa Asp Thr His Tyr Gin Ser Trp Phe 
20 25 30 

Tyr lie Lys Ala Asp Ala Asn Tyr Ala Glu Asn Glu' Trp Leu Lys Gin 
35' 40 45 

Gly Asp Asp Tyr Phe Tyr Leu Lys Ser Gly Gly Tyr Met Ala Lys Ser 
50 55 .60 

Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Tyr Leu Asp Gin Asp Gly 
65 , 70 75 80 

Lys Met Lys Arg Asn Ala Trp Val Gly Thr Ser Tyr Val Gly Ala Thr 
85 90 95 

Gly Ala Lys Val lie Glu Asp Trp Val Tyr Asp Ser Gin Tyr Asp Ala 
100 - 105 110 

Trp Phe Tyr lie Lys ,Ala Asp Gly Gin His Ala Glu Lys Glu Trp Leu 
115 120 125- 

Gin lie Lys Gly Lys Asp Tyr Tyr Phe Lys Ser Gly Gly Tyr Leu Leu 
130 135 140 

Thr Ser Gin Trp lie Asn Gin Ala Tyr Val Asn Ala Ser Gly Ala Lys 
145 150 155 160 

Val Gin Gin Gly Trp Leu Phe Asp Lys Gin Tyr Gin Ser Trp Phe Tyr 
165 - 170 175 

lie Lys Glu Asn Gly Asn Tyr Ala Asp Lys Glu Trp lie Phe Glu Asn 
180 185 190 

Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Tyr* Met Ala Ala Asn Glu 
195 200 205 . 

Trp lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Phe Asp Gly Lys 
210 215 220 

Met Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr 
225 ' 230 235 240 

Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp lie Trp Asp 
245 . 250 255 

Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie Ala Glu Lys 
260 265 270 

Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr Phe Lys Ser 
275 280 285 

Gly Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gly Tyr Gin Leu Gly 
290' 295 300 



Ser Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr Asn Glu Asn Ala Ala 
305 " 310 315 320 

Tyr Tyr Gin Val Val Pro' Val Thr Ala Asn Val Tyr Asp Ser Asp Glyi 
325 330 335 
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Glu Lys Leu Ser Tyr lie Ser Gin Gly Ser Val Val Trp Leu Asp Lys 
340 345 4 350 

Asp Arg Lys Ser Asp Asp Lys Arg Leu Ala lie Thr lie Ser Gly Leu 
355 360 365 

Ser Gly Tyr Met Lys Thr Glu Asp Leu Gin Ala Leu Asp Ala Ser Lys 
370 375 380 

Asp Phe lie Pro Tyr Tyr Glu Ser Asp Gly His Arg Phe Tyr His Tyr 
385 390 395 400 

Val Ala Gin Asn Ala Ser lie Pro Val Ala Ser His Leu Ser Asp Met 
405 410 415 

Glu Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Leu His Phe Asp Gly 
420 425 430 

Phe" Lys "Leu Glu Asn Pro Phe Leu Phe Lys Asp Leu Thr Glu Ala' Thr 
- 435 440 445 

Asn Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Ser Leu Leu Asn lie 
450 455 460 

Asn Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Phe Lys Glu Ala Glu 
465 - 470 475 480 

Glu His Tyr His lie Asn Ala Leu Tyr Leu Leu Ala His Ser Ala Leu 
485 490 495 

Glu Ser Asn Trp Gly Arg Ser Lys lie Ala Lys Asp Lys Asn Asn Phe 
500 505 510 

Phe Gly lie Thr Ala Tyr Asp Thr Thr Pro Tyr Leu Ser Ala Lys Thr 
515 520 525 

Phe Asp Asp Val Asp Lys Gly He Leu Gly Ala Thr Lys Trp He Lys 
530 535 540 

Glu Asn Tyr He Asp Arg Gly Arg Thr Phe Leu Gly Asn Lys Ala Ser 
545 550 555 560 

Gly Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Trp Gly Glu Lys He 
565 570 575 

Ala Ser Val Met Met Lys He Asn Glu Lys Leu Gly Gly Lys Asp 
580 585 590 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1105 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRI-P-TION-:— SEQ— ID-NO : 75: 
TGGGATTCAA TATGTCAGAG ATGATACTAG AGATAAAGAA GAGGGAATAG AGTATGATGA 



CGCTGACAAT GGGGATATTA TTGTAAAAGT AGCGACTAAA CCTAAGGTAG TAACCAAGAA 
AATTTCAAGT ACGCGAATTC GTTATGAAAA AGATGAAACA AAAGAC CGTA GTGAAAATCC 
TGTTACAATT GATGGAGAGG ATGGCTATGT AACTACGACA AGGACCTACG ATGTTAATCC 
AGAGACTGGT TATGTTACCG AACAGGTTAC TGTTGATAGA AAAGAAGC C A CGGATACAGT 
TATCAAAGTT CCAGCTAAAA GCAAGGTTGA AGAAGTTCTT GTTCCATTTG CTACTAAATA 
TGAAGCAGAC AATGACCTTT CTGCAGGACA GGAGCAAGAG ATTACTCTAG GAAAGAATGG 
GAAAACAGTT ACAACGATAA CTTATAATGT AGATGGAAAG AGTGGACAAG TAACTGAGAG 
T AC TTTAAGT CAAAAAAAAG ACTCtCAAAC AAGAGTTGTT AAAAAAAGaA CCArkCCCCA 
AGTTCTTGTC "CAAGAAATTC CAATCGAAAC AGAATATCTC GATGGCCCaA CTCTTGATAA 
AaGTCAAGAA GTAGAAGAAG ' TAGGAGAAAT TGGTAAATTA CTCTTACTAC AATCTATACT 
GGTAGATGAA C GTGATGGAA~ C AATTG AAGA- AAGT AC TTCT- - C GTC AAATT A CTAAAGAGAT 
GGTAAAAAGA CGTATAAGGA GAGGGACGAG AGAACCTGAA AAAGTTGTTG TTCCTGAGCA 
ATCATCTATT CCTTCGTATC CTGTATCTGT TACATCTAAC CAAGGAACAG ATGTAGCAGT 
AGAAC CAGCT AAAGCAGTTG CTCCAACAAC AGACTGGAAA CAAGAAAATG GTATGTGGTA 
TTTTTATAAT ACTGATGGTT CCATGGCAAC AGGTTGGGTA CAAGTTAATA GTTCATGGTA 
CTACCTCAAC AGCAACGGTT CTATGAAAGT CAATCAATGG TTCCAAGTTG GTGGTAAATG 
GTATTATGTA AATACATCGG GTGAGTTAGC GGTCAATACA AGTATAGATG GCTATAGAGT 
CAATGATAAT GGTGAATGGG TGCGT 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 68 amino acids 

(B) TYPE: amino ac'id 

(C) STRANDEDNESS :' single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly lie Gin Tyr Val Arg Asp Asp Thr Arg Asp Lys Glu Glu Gly lie 
1 5 10 15 

Glu Tyr Asp Asp Ala Asp Asn Gly -Asp lie lie Val Lys Val Ala Thr 
20 25 30 

Lys Pro Lys Val Val Thr Lys Lys lie Ser Ser Thr Arg lie Arg Tyr 
35 40 " _ 45 

Glu Lys Asp Glu Thr Lys Asp Arg Ser "Glu Asn Pro Val Thr lie Asp 

50 55 ~ 60 

jGly Glu Asp Gly Tyr Val Thr Thr Thr Arg Thr Tyr Asp Val Asn Pro 
65 70 - - 75 80 
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Glu Thr Gly Tyr Val Thr Glu Gin Val Thr Val Asp Arg Lys Glu Ala 
' ,85 90 95 

Thr Asp Thr Val lie Lys -Val Pro Ala Lys Ser Lys Val Glu Glu Val 
100 105 110 

Leu Val Pro Phe Ala Thr Lys Tyr Glu Ala Asp Asn Asp Leu Ser Ala 
115 120 125 

Gly Gin Glu Gin Glu lie Thr Leu Gly Lys Asn- Gly Lys Thr Val Thr 
130 135 140 

Thr lie Thr Tyr Asn Val Asp Gly Lys Ser Gly Gin Val Thr Glu Ser 
145 150 155 160 

Thr Leu Ser Gin Lys Lys Asp Ser Gin Thr Arg Val Val Lys Lys Arg 
165 170 175 

^ . ■ Thr Xaa Pro Gin Val Leu Val Gin Glu lie Pro lie Glu Thr Glu Tyr 

Q - - 180 185 190 - 

%j .Leu Asp Gly Pro Thr Leu Asp Lys Ser Gin Glu Val Glu Glu Val' Gly 

01 " 195 200 205 

3=j Glu lie Gly Lys Leu Leu Leu Leu Gin Ser lie Leu Val Asp Glu Arg 

^ 210 215 . 220 

3 Asp Gly Thr lie Glu Glu Thr Thr Ser Arg Gin lie Thr Lys Glu Met 

^ 225 230 235 240 

M: Val Lys Arg Arg lie Arg Arg Gly Thr Arg Glu Pro Glu Lys Val Val 

1U 245 250 255 



Val Pro Glu Gin Ser Ser He Pro Ser Tyr Pro Val Ser Val Thr Ser 
260 265 270 

Asn Gin Gly Thr Asp Val Ala Val Glu Pro Ala Lys Ala Val Ala Pro 
275 280 v 285 - 

Thr Thr Asp Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr 
290 295 300 

Asp Gly Ser Met Ala Thr Gly Trp Val Gin .Val Asn Ser Ser Trp Tyr 
305 310 , 315 320 

Tyr Leu Asn Ser Asn Gly Ser Met Lys Val Asn Gin Trp Phe Gin Val 
325 330 335 

Gly Gly Lys Trp Tyr Tyr Val Asn Thr Ser Gly Glu Leu Ala Val Asn 
340 345 350 

Thr Ser lie Asp Gly Tyr Arg Val Asn Asp Asn Gly Glu Trp Val Arg 
355 360 365 . 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 
.JC) STRANDEDNESS : double 

(D)_ TOPOLOGY: linear . 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GGATAATAGA GAAGCATTAA AAACCTTTAT GACGGGTGAA AATTTTTATC TCCAACATTA 
. TCTAGGAGCA CATAGGGAAG AACTAAATGG - AGAGCATGGC TATACCTTCC GTGTTTGGGC 
ACCTAATGCT CAGGCTGTTC ACTTGGTTGG TGATTTTACC AACTGGATTG AAAATCAGAT 
TCCAATGGTA AGAAATGATT TTGGGGTCTG GGAAGTCTTT ACCAATATGG CTCAAGAAGG 
GCATATTTAC AAATATCATG TC AC AC GTC A AAATGGTCAT CAACTGATGA AGATTGACCC 
TTTTGCTGTC AGGTATGAGG CTCGTCCAGG AACAGGGGCA ATC GTAAC AG AGCTTCCTGA 
GAAGAAATGG AAGGATGGAC TTTGGCTGGC AC GAAG AAAA CGTTGGGGCT TTGAAGAGCG 
TCCTGTCAAT ATTTATGAAG TTCACGCTGG ATCATGGAAA AG AAATTC TG ATGGCAGTCC 
TTATAGTTTT GCCCAGCTCA AGGATGAACT CATTCCTTAT CTCGTTGAAA TGAACTATAC 



TCATATTGAG TTTATGCCCT TGATGTCCCA TC CTTTGGGC TTGAGTTGGG GGTATCAGCT 
TAT GGGTT AC TTCGCTTTAG AGCATGCTTA TGGCCGACCA GAGGAGTTTC AAGATTTTGT 
C 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Asp Asn Arg Glu Ala Leu Lys Thr Phe Met Thr Gly Glu Asn Phe Tyr 
1 5 10 15 

Leu Gin His Tyr Leu Gly Ala His Arg Glu Glu Leu Asn Gly Glu His 

20 25 30 ' . 

Gly Tyr Thr Phe - Arg Val Trp Ala Pro Asn Ala Gin Ala Val His Leu 
35 40 45 

Val Gly Asp Phe Thr Asn Trp lie Glu Asn Gin' lie Pro Met Val Arg 
50 55 60 

Asn Asp Phe Gly Val Trp Glu Val Phe Thr Asn Met Ala Gin Glu Gly 
65 70 75 80 

His lie Tyr Lys Tyr His Val Thr. Arg Gin Asn Gly His Gin Leu Met 
.85 " 90 95 

Lys lie Asp Pro_Phe_Ala J£al_Ar.g_Tyr Glu Ala Arg Pro Gly Thr Gly 

100 105 - - HO 

Ala lie Val Thr Glu Leu Pro Glu Lys Lys Trp Lys Asp Gly Leu Trp 
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115 120 125 

Leu Ala Arg Arg Lys Arg Trp Gly Phe Glu Glu Arg Pro Val Asn lie 

130 135 . . . _J 140 

Tyr Glu Val His Ala Gly Ser Trp Lys Arg Asn Ser Asp Gly Ser Pro 
145 - - 150 155 _ 160 

Tyr Ser Phe Ala Gin Leu Lys Asp Glu Leu lie Pro Tyr Leu Val Glu 
165 .170 175 

Met Asn Tyr Thr His lie Glu Phe Met Pro Leu Met Ser His Pro Leu 
180 185 190 

Gly Leu Ser Trp Gly Tyr Gin Leu Met Gly Tyr Phe Ala Leu Glu His 
195 200 205 

Ala Tyr Gly Arg Pro Glu Glu Phe Gin Asp Phe Val 
210 215 220 

INFORMATION FOR SEQ ■ ID" NO':™ 7 9 : - — 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

AGATTTTGTC GAGGAGTGTC ATACCCATAA TATTGGGGTT ATTGTGGACT GGGTACCAGN 60 

TCACTTTACC ATCAACGATG ATGCCTTAGC CTATTATGAT GGGACACCGA CTTTTGAATA 12 0 

CCAAGACCAT AATAAGGCTC ATAACCATGG TTGGGGTGCC CTTAATTTTG AC CTTGGAAA 18 0 

AAATGAAGTC CAGTCCTTCT TAATTTCTTG CATTAAGCAT TGGATTGATG TCTATCATTT 240 

GGATGGTATT CGTGTGGATG CTGTTAGCAA C ATGCTC TAT TTGG AC TATG ATGATGCTCC 300 

ATGGACACCT AATAAAGATG GCGGAAATCT CAACTATGAA GGTTATTATT TCCTTCAGCG 3 60 

CTTGAATGAG GTTATTAAGT TAGAATATCC AGATGTGATG ATGATTGCAG AAGAAAGTTC 42 0 

GTCTGCGATC AAGATTACGG GAATGAAAGA GATTGGTGGT CTAGGATTTG ACTACAAATG 480 

GAACATGGGC TGGATGAATG ATATCCTCCG TTTCTACGAA GAAGATCCGA TCTATC GTAA 540 

ATATGACTTT AACCTGGTGA CTTTCAGCTT TATGTATGTT TNCAAGGAGA ATTATC TCTT 600 

GCCATTCTCG CACGATGAAG TGGTTCATGG CAAGAAGAGT ATGATGCATA AGATGTGGGG 660 

AGATCGTTAC AATCAATTCG CAGGCTTGCG CAATCTCTAT ACGTACCAAA TTTGTCACCC 720 

TGGTAAGAAA TTGC TCTTC A TGGGTAGCGA ATACGGTCAA TTCC TAGAAT GGAAATCTGA 78 0 

AGAACAGTTG GAATGGTCTA ACCTAGAAGA CCCAATGAAT. GCTAAGATGA AGTATTTCGC 840 

TTCTCAGCTA AACCAGTTTT ACAAAGATCA„TCGCTGTCTG TGGGAAATTTG ATACCAGCTA 900 1 

TGATGGTATT GAAATCATTG ATGCGGATAA TCGAGAC CAG AGTGTTC TTT CCTTTATTCG 960 
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TAAGGGTAAA AAGGGA 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



Asp Phe Val Glu Glu Cys His Thr His Asn lie Gly Val lie Val Asp 
1 5 .10 .15 

Trp Val Pro Xaa His Phe Thr lie Asn Asp Asp Ala Leu Ala Tyr Tyr 
20 25 30 

Asp Gly Thr Pro Thr Phe Glu Tyr Gin Asp His Asn Lys Ala His Asn 
35 ~ 40 45 

His Gly Trp Gly Ala Leu Asn Phe Asp Leu Gly Lys Asn Glu Val Gin 
50 55 60 

Ser Phe Leu lie Ser Cys lie Lys His Trp lie Asp Val Tyr His Leu 
65 70 75 80 

Asp Gly lie Arg Val Asp Ala Val Ser Asn Met Leu Tyr Leu Asp Tyr 
85 90 95 

Asp Asp Ala Pro Trp Thr Pro Asn Lys Asp Gly Gly Asn Leu Asn Tyr 
100 105 110 

Glu Gly Tyr Tyr Phe Leu Gin Arg Leu Asn Glu Val lie Lys Leu Glu 
115 120 125 

Tyr Pro Asp Val Met Met lie Ala Glu Glu Ser Ser Ser Ala lie Lys 
130 135 140 

lie Thr Gly Met Lys Glu lie Gly Gly Leu Gly Phe Asp Tyr Lys Trp 
145 150 155 160 

Asn Met Gly Trp Met Asn Asp He Leu Arg Phe Tyr Glu Glu Asp Pro 
165 ' 170 175 

He Tyr Arg Lys Tyr Asp Phe Asn Leu Val Thr Phe Ser Phe Met Tyr 
180 185 . 190 

Val Xaa Lys Glu Asn Tyr Leu Leu Pro Phe Ser His Asp Glu Val Val 
195 200 205 

His Gly Lys Lys Ser Met Met His Lys Met Trp Gly Asp Arg Tyr Asn 
210 215 220 

Gin Phe Ala Gly Leu Arg Asn Leu Tyr Thr Tyr Gin He Cys His Pro 
225 230 235 240 

Gly Lys Lys Leu LeuT'Phe Met Gly Ser Glu Tyr Gly Gin Phe Leu- Glu 
245 250 255 
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Trp Lys Ser Glu Glu- Gin Leu Glu Trp Ser Asn Leu Glu Asp Pro" Met 
260 - 265 270 

Asn Ala Lys Met Lys Tyr Phe Ala Ser Gin Leu Asn Gin Phe Tyr Lys 
275 280 285 

Asp His ' Arg Cys Leu Trp Glu lie Asp Thr Ser Tyr Asp Gly lie Glu 
290 295 300 

lie lie Asp Ala Asp Asn Arg Asp Gin Ser Val Leu Ser Phe lie Arg 
305 310 315 320 

Lys Gly Lys Lys Gly 
325 

(2) INFORMATION FOR SEQ ID NO : 81: 

(i) SEQUENCE CHARACTERISTICS: 

~(A) LENGTH: 2134 base pairs 

— (B)~ TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 81: 

ATCTGTAGTT T ATGC GGATG AAACACTTAT TACTCATACT GCTGAGAAAC CTAAAGAGGA 60 

AAAAATGATA GTAGAAGAAA AGGCTGATAA AGCTTTGGAA ACTAAAAATA TAGTTGAAAG 12 0 

GACAGAACAA AGTGAAC CTA GTTCAACTGA GGCTATTGCA TCTGAGNAGA AAGAAGATGA 180 

AGCCGTAACT CCAAAAGAGG AAAAAGTGTC TGCTAAACCG GAAGAAAAAG CTCCAAGGAT 240 

AGAATCACAA GCTTCAAATC AAGAAAAACC GCTCAAGGAA GATGCTAAAG C TGTAAC AAA 3 00 

TGAAGAAGTG AATCAAATGA TTGAAGACAG GAAAGTGGAT TTTAATCAAA ATTGGTACTT 3 60 

TAAACTCAAT GCAAATTCTA AGGAAGCCAT TAAACCTGAT GCAGACGTAT CTACGTGGAA 420 

AAAATTAGAT TTACCGTATG ACTGGAGTAT CTTTAACGAT TTCGATCATG AATCTCCTGC 480 

ACAAAATGAA GGTGGACAGC TCAACGGTGG GGAAGCTTGG TATCGCAAGA CTTTCAAACT 54 0 

AGATGAAAAA GACCTCAAGA AAAATGTTCG CCTTACTTTT GATGGCGTCT ACATGGATTC 600 

TCAAGTTTAT GTCAATGGTC AGTTAGTGGG GCATTATCCA AATG GTT AT A AC C AGTTCTC 660 

ATATGATATC ACCAAATACC TTCAAAAAGA TGGTCGTGAG AATGTGATTG CTGTCCATGC 72 0 

AGTCAACAAA CAGCCAAGTA GCCGTTGGTA TTCAGGAAGT GGTATCTATC GTGATGTGAC 78 0 

TTTACAAGTG ACAGATAAGG TGCATGTTGA GAAAAATGGG ACAACTATTT T AAC AC C AAA 840 

ACTTGAAGAA CAACAACATG GCAAGGTTGA AACTCATGTG ACCAGCAAAA TCGTCAATAC 900 

GGACGACAAA GACCATGAAC TTGTAGCCGA ATATCAAATC GTTGAACGAG GTGGTCATGC 960 

TGTAAC AGGC TTAGTTCGTA CAGCGAGTCG TACCTTAAAA GCACATGAAT CAACAAGCCT 102 0 

AG ATGC G ATT TTAGAAGTTG AAAGACCAAA ACTCTGGACT GTTTTAAATG AC AAAC CTGC 108 0 
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CTTGTACGAA TTGATTACGC GTGTTTACCG TGACGGTCAA TTGGTTGATG CTAAGAAGGA 

TTTGTTTGGT TACCGTTACT ATCACTGGAC TCCAAATGAA GGTTTCTCTT TGAATGGTGA 

ACGTATTAAA TTCCATGGAG TATC CTTGC A CCACGACCAT GGGGCGC TTG GAGCAGAAGA 

AAACTATAAA GCAGAATATC GCCGTCTCAA ACAAATGAAG GAGATGGGAG TTAACTCCAT 

CCGTACAACC CACAACCCTG CTAGTGAGCA AACCTTGCAA ATCGCAGCAG AACTAGGTTT 

ACTCGTTCAG GAAGAGGCCT TTGATACGTG GTATGGTGGC AAGAAACCTT ATGACTATGG 

ACGTTTCTTT GAAAAAGATG CCACTCACCC AGAAGCTCGA AAAGGTGAAA *AATGGTCTGA 

TTTTG AC CT A CGTACCATGG TCGAAAGAGG CAAAAACAAC CCTGCTATCT TCATGTGGTC 
AATTGGTAAT GAAATAGGTG AAGCTAATGG TGATGCCCAC . TCTTTAGCAA CTGTTAAACG 

TTTGGTTAAG GTTATCAAGG ATGTTGATAA GACTCGCTAT . .GTTACCATGG GA GCAGATAA 

ATTCCGTTTC GGTAATGGTA GCGGAGGGCA TGAGAAAATT GCTGATGAAC TCGATGCTGT 

TGGATTTAAC TATTCTGAAG ATAATTACAA AGCCCTTAGA GCTAAGCATC CAAAATGGTT 

GATTTATGGA TCAGAAACAT CTTCAGCTAC CCGTACACGT GGAAGTTACT ATCGCCCTGA 

ACGTGAATTG AAACATAGCA ATGGACCTGA GCGTAATTAT GAACAGTCAG ATTATGGAAA 

TGATCGTGTG GGTTGGGGGA AAACAGCAAC CGCTTCATGG ACTTTTGACC GTGAC AACGC 

TGGCTATGCT GGACAGTTTA TCTGGACAGG TACGGACTAT ATTGGTGAAC CTACACCATG 
GCACAACCAA AATCAAACTC CTGTTAAGAG CTCTTACTTT . GGTATCGTAG ATACAGCCGG 
CATTCCAAAA C ATGAC TTCT ATCTCTACCA AAGC 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

Ser Val Val Tyr Ala Asp Glu Thr Leu lie Thr His Thr Ala Glu Lys 

1 5 10 ' - . _ 15 

Pro Lys Glu Glu Lys Met lie Val Glu Glu Lys Ala Asp Lys Ala Leu 

20 .25 _ ' 30 

Glu Thr Lys Asn lie Val Glu Arg Thr Glu Gin Ser Glu Pro Ser Ser 

35 .40 45 

Thr Glu Ala lie Ala Ser Glu Xaa Lys Glu Asp Glu Ala Val Thr Pro 

50 " ■ .55 60" ' " 



Lys. Glu Glu Lys Val Ser Ala Lys Pro Glu Glu Lys Ala Pro Arg lie 
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65 70 . 75 80 

Glu Ser Gin Ala Ser Asn Gin Glu Lys Pro Leu Lys Glu Asp Ala Lys 
85 90" 95 

Ala Val Thr Asn Glu Glu Val Asn Gin Met lie Glu Asp Arg Lys Val 
100 105 110" 

Asp Phe Asn Gin Asn Trp Tyr Phe Lys Leu Asn Ala Asn Ser Lys Glu 
115 120 125 ' 

Ala lie Lys Pro Asp Ala Asp Val Ser Thr Trp Lys Lys Leu Asp Leu 
130 : 135 140 

Pro Tyr Asp Trp Ser lie Phe Asn Asp Phe Asp His Glu Ser Pro Ala 
145 150 1^55 160 

Gin Asn Glu Gly Gly Gin Leu Asn Gly Gly Glu Ala Trp Tyr Arg Lys 
165 170 175 

Thr Phe Lys Leu Asp Glu Lys Asp Leu Lys Lys Asn Val Arg Leu Thr 
180 185 190 

Phe Asp Gly Val Tyr Met Asp Ser Gin Val Tyr Val Asn Gly Gin Leu 
195 200 205 

Val Gly His Tyr Pro Asn Gly Tyr Asn Gin Phe Ser Tyr Asp lie Thr 
210 215 220 

Lys Tyr Leu Gin Lys Asp Gly Arg Glu Asn Val lie Ala Val His Ala 
225 230 235 240 

Val Asn Lys Gin Pro Ser Ser Arg Trp Tyr Ser Gly Ser Gly lie Tyr 
245 250 255 

Arg Asp Val Thr Leu Gin Val Thr Asp Lys Val His Val Glu Lys Asn 
260 265 270 

Gly Thr Thr lie Leu Thr Pro Lys Leu Glu Glu Gin Gin His Gly Lys 
275 280 285 

Val Glu Thr His Val Thr Ser Lys lie Val Asn Thr Asp Asp Lys Asp 
290 295 . 300 

His Glu Leu Val Ala Glu Tyr Gin lie Val Glu Arg Gly Gly His Ala 
305 310 315 320 

Val Thr Gly Leu Val Arg Thr Ala Ser Arg Thr Leu Lys Ala His Glu 
325 330 ' 335 

Ser Thr Ser Leu Asp Ala lie Leu Glu Val Glu Arg Pro Lys Leu Trp 
340 345 350 

Thr Val Leu Asn Asp Lys Pro Ala Leu Tyr Glu Leu lie Thr Arg Val 
355 360 365 

Tyr Arg Asp Gly Gin Leu Val Asp Ala Lys Lys Asp Leu Phe Gly Tyr 
~ 370 375 380 

Arg Tyr Tyr His Trp Thr Pro Asn Glu Gly Phe Ser Leu Asn Gly Glu 

. 385 . 390 . 395 400 



Arg lie Lys Phe His Gly Val Ser Leu His His Asp His Gly Ala Leu 



n 
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405 410 415 

Gly Ala Glu Glu Asn Tyr Lys Ala Glu Tyr Arg Arg Leu Lys Gin Met 
420 425 430 

Lys Glu Met Gly Val Asn Ser lie Arg Thr Thr His Asn Pro Ala Ser 
435 440 445 

Glu Gin- Thr Leu Gin lie Ala Ala Glu Leu Gly Leu Leu Val Gin Glu 
450 .455 460 

Glu Ala Phe Asp Thr Trp Tyr Gly Gly Lys Lys Pro Tyr Asp Tyr Gly 
465 470 475 480 

Arg Phe Phe Glu Lys Asp Ala Thr His Pro Glu Ala Arg Lys Gly Glu 
485 490 495 

Lys Trp Ser Asp Phe Asp Leu Arg Thr Met Val Glu Arg Gly Lys Asn 
500 505 510 

~Asn Pro Ala He Phe Met Trp Ser He Gly Asn Glu He Gly Glu Ala 
515 520 525 

Asn Gly Asp Ala His Ser Leu Ala Thr Val Lys Arg Leu Val Lys Val 
530 535 540 

He Lys Asp Val Asp Lys Thr Arg Tyr Val Thr Met Gly Ala Asp Lys 
545 550 555 560 

Phe Arg Phe Gly Asn Gly Ser Gly Gly His Glu Lys He Ala Asp Glu 
565 570 575 

Leu Asp Ala Val Gly Phe Asn Tyr Ser Glu Asp Asn Tyr Lys Ala Leu 
580 585 590 

Arg Ala Lys His Pro Lys Trp Leu He Tyr Gly Ser Glu Thr Ser Ser 
595 600 605 

Ala Thr Arg Thr Arg Gly Ser Tyr Tyr Arg Pro Glu Arg Glu Leu Lys 
610 615 ' 620 

His Ser Asn Gly Pro Glu Arg Asn Tyr Glu Gin Ser Asp Tyr Gly Asn . 
625 630 635 640 

Asp Arg Val Gly Trp Gly Lys Thr Ala Thr Ala Ser Trp Thr Phe Asp 
645 650 655 

Arg Asp Asn Ala Gly Tyr Ala Gly Gin Phe He Trp Thr Gly Thr Asp 
660 665 670 

Tyr He Gly Glu Pro Thr Pro Trp His Asn Gin Asn Gin Thr Pro Val 
675 680 685 

Lys Ser Ser Tyr Phe Gly lie Val Asp Thr Ala Gly lie Pro Lys His 
690 695 700 

Asp Phe Tyr Leu Tyr Gin Ser . ^ ' . 

705 710 ' ~ 



(2). INFORMATION FOR SEQ ID-NO 83..: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2167 base pairs 



(B) 
(C) 
(D) 



TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: < 
TTACTTTGGT ATCGTAGATA CAGCCGGCAT 
CCAATGGGTT TCTGTTAAGA AGAAACC GAT 
AAACAAAGAA TTAGCATCCA AAGTAGCTGA 
TTCGAATGCT TCTAGTGTAG AATTGTTCTT 
CAATAAAAAA CAAACCAGCG ATGGGCGGAC 
TTATCTTGAA TGGAAAGTTG CCTATCAACC 
ATCTGGCAAG GAAATTGCTC GAGATAAGAT 
TCTTATTAAG GAAGACCATG CGATTGCAGC 
TGAAATTGTT GACAGCCAGG GGAATGTGGT 
ATTGCATGGC CAAGGTCAAC TGGTCGGTGT 
CTATAAGGCG CAAGCAGATG GTTCTTGGAT 
CATTGTCAAA TCAACTGAAC AAGCAGGGAA 
GAAATCGAAC CAAGTCACTG TCTTTACTGG 
GGGGACAGAA GTGCCAAAAG TACAGACCAT 
TGTTCCGTTT GTATACAGTG ATGGTAGCCG 
AGATGTGAGC AAGCCTGGTA TTGTAACGGT 
AGCTCGTGTA GAAGTGATTG CTCTTAAATC 
AAATACTGAC TTGAATTCTG TAGACAAATC 
TGAAGAGTAT GAAGTGGACA AGTGGGAGAT 
TCCAGGTTCT CGTATTCAAG CGACCGGTTA 
TGTGGTAGAA GAAGGCAATC CTGCGGCACC 
GGCAGTAACA GGTCTTACTA GTCAAAAACC 
TAAGTTGCCA GAAGTCACAG CAAGTGCTAA 
CGCAGCAAAC GGCATGCGTG CGAGCATCTT 
AACCTATGCA ATTCAATTCC TTGAAGAAGC 
GGAAAAAGCT GACAGTCTCA AAGAAGACCA 
- TCAAGATGGA ACGCAAGCTG TATTACCAGC 
AGGGGAAGTC GCAATTCGTA AAGGAATGCT 



!EQ ID NO: 83: . 
TCCAAAACAT GACTTCTATC TCTACCAAAG 
GGTACACCTT CTTCCTCACT GGAACTGGGA 
CTCAGAAGGT AAGATTC C AG ' TTCGTGCTTA 
GAATGGAAAA TCTCTTGGTC TTAAGACTTT 
TTACCAAGAA GGTGCAAATG CTAATGAACT 
AGGTAC CTTG GAAGCAATTG CTCGTGATGA 
TACGACTGCT~'GGTXAGCCAG CGGCAGTTCG 
AGATGGAAAA G AC TTGAC TT ACATCTACTA 
TCCAACTGCT AATAATCTGG TTCGCTTCCA 
AGATAACGGA GAACAAGCCA GCCGTGAACG 
TCGTAAAGCA TTTAATGGTA AAGGTGTTGC 
ATTCACCCTG ACTGCCCACT CTGATCTCTT 
TAAGAAAGAA GGACAAGAGA AGACTGTTTT 
TATTGGAGAG GC AC CTG AAA TGCCTACCAC 
TGCAGAACGT CCTGTAACCT GGTCTTCAGT 
GAAAGGTATG GCTGACGGAC GAGAAGTAGA 
AGAGCTACCA GTTGTGAAAC GTATTGCTCC 
TGTTTC CTAT GTTTTGATTG- ATGGAAGTGT 
TGCCGAAGAA GATAAAGCTA AGTTAGCAAT 
TTTAGAAGGT CAACCAATTC ATGCAACCCT 
TGCAGTACCA ACTGTAACGG TTGGTGGTGA 
AATGCAATAC CGCACTCTTG CTTATGGAGC 
AAATGCAGCT GTTACAGTTC TTCAAGCAAG 
TATTCAGCCT AAAGATGGTG GCCCTCTTCA 
GCCAAAAATT GCTCACTTGA GCTTGCAAGT 
AACTGTCAAA TTGTCGGTTC GAGCTCACTA 
-TGATAAAGTA ACCTTCTCTA CAAGTGGTGA 
TGAGTTGCAT AAGCCAGGAG CAGTCACTCT 



3,5- 
3"; - 
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GAACGCTGAA TATGAGGGAG CTAAAGACCA AGTTGAACTC ACTATCCAAG CCAATACTGA 174 0 

GAAGAAGATT GCGCAATCCA TCCGTCCTGT AAATGTAGTG ACAGATTTGC ATCAGGAACC 1800 

AAGTCTTCCA GCAACAGTAA CAGTTGAGTA TGACAAAGGT TTCCCTAAAA CTCATAAAGT I8 60" 

CACTTGGCAA GCTATTCCGA AAGAAAAACT AGACTCCTAT CAAACATTTG AAGTACTAGG 192 0 

TAAAGTTGAA GGAATTGACC TTGAAGCGCG TGCAAAAGTC TCTGTAGAAG GTATCGTTTC 1980 

AGTTGAAGAA GTCAGTGTGA CAACTCCAAT CGCAGAAGCA CCACAATTAC CAGAAAGTGT 2040 

TCGGACATAT GATTCAAATG GTC AC GTTTC ATCAGCTAAG GTTGCATGGG ATGCGATTCG 2100 

TCCAGAGCAA TACGCTAAGG AAGGTGTCTT TACAGTTAAT GGTC GCTTAG AAGGTACGCA 2160 
ATTAACA - .2167 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 amino acids 

(B) TYPE: amino- acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

Tyr Phe Gly lie Val Asp Thr Ala Gly lie Pro Lys ■ His Asp Phe' Tyr 
1 5 ■ 10 15 

Leu Tyr Gin Ser Gin Trp Val Ser Val Lys' Lys Lys Pro Met Val His 
20 25 30 

Leu Leu Pro His Trp Asn Trp Glu Asn Lys Glu Leu Ala Ser Lys Val 
35 .40 45 

Ala Asp Ser Glu Gly Lys lie Pro Val Arg Ala Tyr Ser Asn Ala Ser 

.50 .55-60 

Ser Val Glu Leu Phe Leu Asn Gly Lys Ser Leu Gly Leu Lys Thr Phe 
65 70 75 80 

Asn Lys Lys Gin Thr Ser Asp Gly Arg Thr Tyr Gin Glu Gly Ala Asn 
85 90 95 

Ala Asn Glu Leu Tyr Leu Glu Trp Lys Val Ala Tyr Gin Pro Gly Thr 
100 105 110 

Leu Glu Ala lie Ala Arg Asp Glu Ser Gly Lys Glu lie Ala Arg Asp 
115 120 125 

Lys lie Thr Thr Ala Gly Lys Pro Ala "Ala Val Arg Leu lie Lys Glu 
130 1 " 135 - 140 

Asp His Ala li e Ala A la Asp Gly Lys Asp Leu Thr Tyr He Tyr "Tyr 
. 145 " : 150 ' ~" . . 155 160 

Glu He Val Asp Ser Gin Gly Asn Val Val Pro Thr Ala Asn Asn Leu 
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165 170 175 

Val Arg Phe Gin Leu His Gly Gin Gly Gin Leu Val Gly Val Asp Asn 
180 185 -. 190 

Gly Glu Gin Ala Ser Arg Glu Arg Tyr Lys Ala Gin Ala Asp Gly Ser 
195 200 205 

Trp lie Arg Lys Ala Phe Asn Gly Lys Gly Val Ala lie Val Lys Ser 
210 215 220 

Thr Glu Gin Ala Gly Lys Phe Thr Leu Thr Ala His Ser Asp Leu Leu 
225 • 230^ 235 240 

Lys Ser Asn Gin Val Thr Val Phe Thr Gly Lys Lys Glu Gly Gin Glu 
245 250 255 

Lys, Thr Val Leu Gly Thr Glu Val Pro Lys Val Gin Thr lie lie Gly 
260 - 265 270 

"Glu" Ala Pro Glu Met Pro Thr Thr Val Pro Phe Val Tyr Ser Asp Gly 
275 280 285 

Ser Arg Ala Glu Arg -Pro Val Thr Trp Ser Ser Val Asp Val Ser Lys 
290 295 300 

Pro Gly lie Val Thr Val Lys Gly Met Ala Asp Gly Arg Glu Val Glu 
305 310 315 320 

Ala Arg Val Glu Val He Ala Leu Lys Ser Glu Leu Pro Val Val Lys 
325 330 335 

Arg He Ala Pro Asn Thr Asp Leu Asn Ser Val Asp Lys Ser Val Ser 
340 345 350 

Tyr Val Leu He Asp Gly Ser Val Glu Glu Tyr Glu Val Asp Lys Trp 
355 360 365 

Glu He Ala Glu Glu Asp Lys Ala Lys Leu Ala He Pro Gly Ser Arg 
370 375 380 

He Gin Ala Thr Gly Tyr Leu Glu Gly Gin Pro He His Ala Thr Leu- 
385 390 395 400 

Val Val Glu Glu Gly Asn Pro Ala Ala Pro Ala Val Pro Thr Val Thr 
405 410 415 

Val Gly Gly Glu Ala Val Thr Gly Leu Thr Ser Gin Lys Pro Met Gin 
420 425 430 

Tyr Arg Thr Leu Ala Tyr Gly. Ala Lys Leu Pro Glu Val Thr Ala Ser 
435 440 445 

Ala Lys Asn Ala Ala Val Thr Val Leu Gin Ala Ser Ala Ala Asn Gly 
450 455 460' 

Met Arg Ala Ser He Phe He Gin Pro Lys Asp Gly Gly Pro Leu Gin 
465 _ 470 475 "~ 480 

Thr Tyr Ala He Gin Phe Leu Glu_G.lu...Ala_Ero_Lys He Ala His Leu 
485 490 .. . 495 . . 

Ser Leu Gin Val Glu Lys Ala Asp' Ser Leu Lys Glu Asp Gin Thr -Val 
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500 505 510 

Lys Leu Ser Val Arg Ala His Tyr Gin Asp Gly Thr Gin Ala Val Leu 

515 - - 520 525 

Pro Ala Asp Lys Val Thr Phe Ser Thr Ser Gly Glu Gly Glu Val Ala 

530 535 ' - 540 

lie Arg Lys Gly Met Leu Glu Leu His Lys Pro Gly Ala Val Thr Leu 

545 550 . 555 560 

Asn Ala Glu Tyr Glu Gly, Ala Lys Asp Gin Val Glu Leu Thr lie Gin 
565 570 575 

Ala Asn Thr Glu Lys Lys lie Ala Gin Ser lie Arg Pro Val Asn Val 
580 585 590 

Val Thr Asp Leu His Gin Glu Pro Ser Leu Pro Ala Thr Val Thr Val 
.595 600 605 

Glu' Tyr 7 Asp Ly's~GTy Phe "Pro Lys Thr 'His Lys Val Thr Trp Gin Ala 
610 615 620 

lie Pro Lys Glu Lys Leu Asp Ser Tyr Gin Thr Phe Glu Val Leu Gly 
625 630 635 640 

Lys Val Glu Gly lie Asp Leu Glu Ala Arg Ala Lys Val Ser Val Glu 
645 650 655 

Gly lie Val Ser Val Glu Glu Val Ser Val Thr Thr Pro lie Ala Glu 
660 665 670 

Ala Pro Gin Leu Pro Glu Ser Val Arg Thr Tyr Asp Ser Asn Gly His 
675 . 680 685 

Val Ser Ser Ala Lys Val Ala Trp Asp Ala lie Arg Pro Glu Gin Tyr 
690 695 700 

Ala Lys Glu Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin 
705 710 715 720 

Leu Thr 



(2) INFORMATION FOR SEQ ID NO : 85: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
AGCTAAGGTT GCATGGGATG CGATTCGTCC AGAGCAATAC GCTAAGGAAG GTGTCTTTAC 60 
- AGTTAATGGT CGCTTAGAAG GTACGCAATT AACAACTAAA CTTCATGTTC GCGTATCTGC 120 



TCAAACTGAG CAAGGTGCAA ACATTTCTGA CCAATGGACC GGTTCAGAAT TGCCACTTGC 18 0 

CTTTGCTTCA GACTCAAATC CAAGCGACCC AGTTTCAAAT GTTAATGACA AGCTCATTTC 240 



203 



CTACAATAAC CAACCAGCCA ATCGTTGGAC AAACTGGAAT CGTACTAATC CAGAAGCTTC 3 00 

■■- AGTCGGTGTT C TGTTTGG AG ATTCAGGTAT CTTGAGCAAA- CGCTCCGTTG ATAATCTAAG 3 60 

TGTCGGATTC CATGAAGACC ATGGAGTTGG "TGTACCGAAG TCTTATGTGA TTGAGTATTA 42 0 

TGTTGGTAAG ACTGTCCCAA CAGCTCCTAA AAACCCTAGT TTTGTTGGTA ATGAGGACCA 48 0 

TGTCTTTAAT GATTCTGCCA ACTGGAAACC AGTTACTAAT CTAAAAGCCC CTGCTCAACT 54 0 

CAAGGCTGGA GAAATGAACC ACTTTAGCTT TGATAAAGTT GAAACCTATG CTGTTCGTAT 600 

TCGCATGGTT AAAGCAGATA ACAAGCGTGG AACGTCTATC ACAGAGGTAC AAATCTTTGC 660 

GAAACAAGTT GCGGCAGCCA AGCAAGGACA AACAAGAATC CAAGTTGACG GCAAAGACTT 720 

AGCAAACTTC AACCCTGATT TGACAGACTA CTACCTTGAG TCTGTAGATG GAAAAGTTCC - .780 

Q GGCAGTCACA GCAAGTGTTA GCAACAAT GG TCTCGCTACC GTCGTTCCAA GCGTTCGTGA ~ 840 

C| ' AGGTGAGCCA GTTCGTGTCA TCGCGAAAGC TGAAAATGGC GACATCTTAG GAGAATACCG 900 

1% TCTGCACTTC ACTAAGGATA AGAGCTTACT TTCTCATAAA CCAGTTGCTG CGGTTAAACA 960 

^ AGCTCGCTTG CTACAAGTAG GTCAAGCACT TGAATTGCCG AC T AAGGTTC CAGTTTACTT 1020 

"~4 

H CACAGGTAAA GACGGCTACG AAACAAAAGA CCTGACAGTT GAATGGGAAG AAGTTCCAGC 1080 

Q GGAAAATCTG ACAAAAGCAG GTCAATTTAC TGTTCGAGGC CGTGTCCTTG GTAGTAACCT 1140 

TGTTGCTGAG ATC AC TGT AC GAGTGACAGA CAAACTTGGT GAGACTCTTT CAGATAACCC 1200 
TAACTATGAT GAAAACAGTA ACCAGGCCTT TGCTTCAGCA ACCAATGATA TTGACAAAAA " . 1260 

CTCTCATGAC CGCGTTGACT ATCTCAATGA CGGAGATCAT TCAGAAAATC GTCGTTGGAC 13 20 

AAACTGGTCA CCAACACCAT CTTCTAATCC AGAAGTATCA GCGGGTGTGA^ TTTTCCGTGA 1380 

AAATGGTAAG ATTGTAGAAC GGACTGTTAC ACAAGGAAAA GTTCAGTTCT TTGCAGATAG 1440 

TGGTACGGAT GC AC CATC T A AACTCGTTTT AGAACGCTAT GTCGGTCCAG AGTTTGAAGT 1500 

GCCAACCTAC TATTCAAACT ACCAAGCCTA CGACGCAGAC CATCCATTCA ACAATCCAGA 1560 

AAATTGGGAA GCTGTTCCTT ATCGTGCGGA TAAAGACATT GCAGCTGGTG ATGAAATCAA 162 0 

CGTAACATTT AAAGCTATCA AAGC C AAAGC TATGAGATGG CGTATGGAGC GTAAAGCAGA 168 0 

TAAGAGC GGT GTTGCGATGA TTGAGATGAC CTTCCTTGCA CCAAGTGAAT TGCCTCAAGA 1740 

AAGCACTCAA TCAAAGATTC TTGTAGATGG AAAAGAACTT GCTGATTTCG CTGAAAATCG 1800 

.TCAAGACTAT CAAATTACCT ATAAAGGTCA ACGGCCAAAA GTCTCAGTTG AAGAAAACAA 186 0 

TCAAGTAGCT TCAACTGTGG TAGATAGTGG AGAAGATAGC TTTC CAGTAC TTGTTCGCCT 1920 

CGTTTCAGAA AGTGGAAAAC AAGTCAAGGA ATACCGTATC__ CACTTGACTA AGGAAAAACC 1980 

AGTTTCTGAG AAGAC AGTTG . CTGCTGTACA AGAAGATCTT CCAAAAATCG AATTTGTTGA ^2.04.0 



AAAAGATTTG GCATACAAGA CAGTTGAGAA AAAAGATTCA AC AC TGTATC TAGGTGAAAC ' 210.0 
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TCGTGTAGAA CAAGAAGGAA AAGTTGGAAA AGAACGTATC TTTACAGCGA TTAATCCTGA 216Q 

TGGAAGTAAG GAAGAAAAAC TCCGTGAAGT GGTAGAAGTT CCGACAGACC GCATCGTCTT 2220 

GGTTGGAACC AAACCAGTAG CTCAAGAAGC TAAAAAACCA . CAAGTGTCAG AAAAAGCAGA 2280 

TACAAAACCA ATTGATTCAA GTGAAGCTAG TCAAACTAAT AAAGCCCAG 23 29 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



Ala Lys Val Ala Trp Asp Ala lie Arg Pro Glu Gin Tyr Ala Lys Glu 
1 5 10 15 

Gly Val Phe Thr Val Asn Gly Arg Leu Glu Gly Thr Gin Leu Thr Thr 
20 25 30 

Lys Leu His Val Arg Val Ser Ala Gin Thr Glu Gin Gly Ala Asn lie 
35 40 45 

Ser Asp Gin Trp Thr Gly Ser Glu Leu Pro Leu Ala Phe Ala Ser Asp 
50 55 60 

Ser Asn Pro Ser Asp Pro Val Ser Asn Val Asn Asp Lys Leu lie Ser 
65 7 0 75 80 

Tyr Asn Asn Gin Pro Ala Asn Arg Trp Thr Asn Trp Asn Arg Thr Asn 
85 90 95 

Pro Glu Ala Ser Val Gly Val Leu Phe Gly Asp Ser Gly lie Leu Ser 
100 105 110 

Lys Arg Ser Val Asp Asn Leu Ser Val Gly Phe His Glu Asp His Gly 
115 120 125 

Val Gly Val Pro Lys Ser Tyr . Val He Glu Tyr Tyr Val Gly Lys Thr 
130 135 140 

Val Pro Thr Ala Pro Lys Asn Pro Ser Phe Val Gly Asn Glu Asp His 
145 150 155 160 

Val Phe Asn Asp Ser Ala Asn Trp Lys Pro Val Thr Asn Leu Lys Ala 

165- 170 — 175 

Pro Ala Gin Leu Lys Ala Gly Glu Met Asn His Phe Ser Phe Asp Lys 
180 185' 190 

Val Glu Thr Tyr Ala Val Arg He Arg Met Val Lys Ala Asp_Asn Lys 
195 200 205 7 

Arg Gly Thr Ser He Thr Glu Val Gin He Phe-Ala Lys (Sin Val Ala- 
210 215 - 220 
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Ala Ala Lys Gin Gly Gin Thr Arg lie Gin Val Asp Gly Lys Asp Leu 

225 230 - 235 240 

Ala Asn Phe Asn Pro Asp Leu Thr Asp Tyr Tyr Leu Glu Ser Val Asp 

245 250 255 

Gly Lys Val Pro' Ala Val Thr Ala Ser Val Ser Asn Asn Gly Leu Ala 

. 260 265 .270 - 

Thr Val Val Pro Ser Val Arg Glu Gly Glu Pro Val Arg Val lie Ala 

, 275 * 280 285." 

Lys Ala Glu Asn Gly Asp lie Leu Gly Glu Tyr Arg Leu His Phe Thr 
290 295 300 

Lys Asp Lys Ser Leu Leu Ser His Lys Pro Val Ala Ala Val Lys Gin 

305 310 315 320 

Ala Arg Leu Leu Gin Val Gly Gin * Ala Leu Glu Leu Pro Thr Lys Val 

325 330 335. 



: *JJ Pro Val Tyr Phe Thr Gly Lys Asp Gly Tyr Glu Thr Lys Asp Leu Thr 

%J 340 345 350 

;| s i . . 

ijji Val Glu Trp Glu Glu Val Pro Ala Glu Asn Leu Thr Lys Ala Gly Gin 

jjij 355 360 365 

Phe Thr Val Arg Gly Arg Val Leu Gly Ser Asn Leu Val Ala Glu lie 
^ 370 375 380 

v. 

O Thr Val Arg Val ' Thr Asp Lys Leu Gly Glu Thr Leu Ser' Asp Asn Pro 

385 390 395 400 

ijij Asn Tyr Asp Glu Asn Ser Asn Gin Ala Phe" Ala Ser Ala Thr Asn Asp 

405 410 415 

J~ lie Asp Lys Asn Ser His Asp Arg Val Asp Tyr Leu Asn Asp Gly Asp 

420 * 425 430 

His Ser Glu Asn Arg Arg Trp Thr Asn Trp Ser Pro Thr Pro Ser Ser 
435 440 445 

Asn Pro Glu Val Ser Ala Gly Val lie Phe Arg Glu Asn Gly Lys lie 
450 455 460 

Val Glu Arg Thr Val Thr Gin Gly Lys Val Gin Phe Phe Ala Asp Ser 
465 470' 475 480 

Gly Thr Asp Ala Pro Ser Lys Leu Val Leu Glu Arg Tyr Val Gly Pro 
485 490 495 

Glu Phe Glu Val Pro Thr Tyr Tyr Ser Asn Tyr Gin Ala Tyr Asp Ala 
500 505 510 

Asp His Pro Phe Asn Asn Pro Glu. Asn Trp Glu Ala Val Pro Tyr Arg 
515 520 525 

Ala Asp Lys Asp lie Ala Ala Gly Asp Glu lie Asn Val Thr Phe Lys 
530 535 540 

Ala lie Lys Ala Lys Ala Met Arg Trp Arg Met Glu Arg Lys Ala -Asp 

54 5 550 5-55 ... - 560 



Lys Ser Gly Val Ala Met lie Glu Met Thr Phe Leu Ala Pro Ser Glu 
565 570 - 575 



Leu Pro Gin Glu Ser Thr Gin Ser Lys lie Leu Val Asp Gly Lys Glu 
580 585 590 

Leu Ala Asp Phe Ala Glu Asn Arg Gin Asp Tyr Gin lie Thr Tyr Lys 
595 600 605 

Gly Gin Arg Pro Lys Val Ser Val Glu Glu Asn Asn Gin Val Ala Ser 
610 615 620 

Thr Val Val Asp Ser Gly Glu Asp Ser Phe Pro Val Leu Val Arg Leu 
625 630 635 640 

Val Ser Glu Ser Gly Lys Gin Val Lys Glu Tyr Arg lie His Leu Thr 
645 650 .655 - 

Lys Glu Lys Pro Val Ser Glu Lys Thr Val Ala Ala Val Gin Glu Asp 
' 660 665 670 

Leu Pro Lys lie Glu Phe Val - Glu Lys Asp Leu Ala Tyr Lys Thr Val 
675 680 685 

o 

Glu Lys Lys Asp Ser Thr Leu Tyr Leu Gly Glu Thr Arg Val Glu Gin 
690 695 700 

Glu Gly Lys Val Gly Lys Glu Arg lie Phe Thr Ala lie Asn Pro Asp 
705 710 715 720 

Gly Ser Lys Glu Glu Lys Leu Arg Glu Val Val Glu Val Pro Thr Asp 
725 730 735 

Arg lie Val Leu Val Gly Thr Lys Pro Val Ala Gin Glu Ala " Lys Lys 
740 745/ 750 

Pro Gin Val Ser Glu Lys Ala Asp Thr Lys Pro lie Asp Ser Ser Glu 
755 760 765 . 

Ala Ser Gin Thr Asn Lys Ala Gin 
770 775 

(2) INFORMATION FOR SEQ ID NO: 87 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CTATCACTAT GTAAATAAAG AGATTATTTC ACAAGAAGCT AAAGATTTAA TTCAGACAGG. 
AAAGCCTGAC . AGGAATGAAG TTGTATATGG TTTGGTGTAT CAAAAAGATC AGTTGCCTCA 
AACAGGGACA GAA ' 

(2). INFORMATION' FOR SEQ ID NO:88: 
(i) SEQUENCE CHARACTERISTICS: 



207 



(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

Tyr His Tyr Val Asn Lys Glu lie lie Ser Gin Glu Ala Lys Asp Leu 
1 5 10 15 

lie Gin Thr Gly Lys Pro Asp Arg Asn Glu Val Val Tyr Gly Leu Val 
20 25 30 - 

Tyr Gin Lys Asp Gin Leu Pro Gin Thr Gly Thr Glu 
.35 40 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TGAGACTCCT CAATCAATAA CAAATCAGGA GCAAGCTAGG ACAGAAAACC AAGTAGTAGA 60 

GACAGAGGAA GCTCCAAAAG AAGAAGCACC TAAAACAGAA GAAAGTCCAA AGGAAGAACC 12 0 

AAAATC GG AG GTAAAACCTA CTGACGACAC CCTTCCTAAA GTAGAAGAGG GGAAAGAAGA 180 

TTCAGCAGAA CCAGCTCCAG TTGAAGAAGT AGGTGGAGAA GTTGAGTCAA AACCAGAGGA 240 

AAAAGTAGGA GTTAAGCCAG AAAGTCAACC -ATCAGACAAA CCAGCTGAGG AATCAAAAGT 3 00' 

TGAACAAGCA GGTGAACCAG TCGCGCCAAG AGAAGACGAA AAGGCACCAG TCGAGCCAGA 3 60 

AAAGCAACCA GAAGCTCCTG AAGAAGAGAA GGCTGTAGAG GAAACACCGA AACAAGAAGA 42 0 

GTCAACTCCA GATACCAAGG CTGAAGAAAC TGTAGAACCA' AAAGAGGAGA CTGTTAATCA 480 

ATC TATTGAA CAACCAAAAG TTGAAACGCC TGCTGTAGAA AAACAAACAG AACCAACAGA 540 

GGAACCAAAA GTTGAACAAG CAGGTGAACC AGTCGCGCCA AG AG AAG AC G AACAGGCACC 600 

AACGGCACCA GTTGAGCCAG AAAAGCAACC AGAAGTTCCT GAAGAAGAGA AGGCTGTAGA 660 

GGAAACACCG AAACCAGAAG ATAAAATAAA GGGTATTGGT ACTAAAGAAC CAGTTGATAA 720 

AAGTGAGTTA AATAATCAAA TTGATAAAGC TAGTTCAGTT TCTCCTACTG ATTAT 77 5 
(2) INFORMATION FOR SEQ ID NO: 90: 

— (i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 258 amino acids . 

(B) TYPE: amino acid 

(C) " STRANDEDNESS: single 
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CD) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Glu Thr Pro Gin Ser lie Thr Asn Gin Glu Gin Ala Arg Thr 
1 5 .10 

Gin Val Val Glu Thr Glu Glu Ala Pro Lys Glu Glu Ala Pro Lys Thr 
20 25 30 

Glu Glu Ser Pro Lys Glu* Glu Pro Lys Ser Glu Val Lys Pro Thr Asp 
35 .40 45 

■ Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Glu Asp Ser Ala Glu Pro 
50 55 60 

Ala Pro Val Glu Glu Val Gly Gly Glu Val Glu Ser Lys Pro Glu Glu 
65 70 75 80 

Lys Val Ala Val Lys Pro Glu Ser Gin Pro Ser Asp Lys Pro Ala Glu 
85 90 95 

Glu Ser Lys Val Glu Gin Ala Gly Glu Pro Val Ala Pro Arg Glu Asp 
100 105 110 

Glu Lys Ala Pro Val Glu Pro Glu Lys Gin Pro Glu Ala Pro Glu Glu 
115 120 125 

Glu Lys Ala Val Glu Glu Thr Pro Lys Gin Glu Glu Ser Thr Pro Asp 
130 135 140 

Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Glu Glu Thr Val Asn Gin 
145 .150 155 160 

Ser lie Glu Gin Pro Lys Val Glu Thr Pro Ala Val Glu Lys Gin Thr 
165 170 175 

Glu Pro Thr Glu Glu Pro Lys Val Glu Gin Ala Gly Glu Pro Val Ala 
180 185 190 

Pro Arg Glu Asp Glu Gin Ala Pro Thr Ala Pro Val Glu Pro Glu Lys 
195 200 205 

Gin Pro Glu Val Pro Glu Glu Glu Lys Ala Val Glu Glu Thr Pro Lys 
210 215 220 

Pro Glu Asp Lys. lie Lys Gly lie Gly Thr Lys Glu Pro Val Asp Lys 
225 230 235 240 

Ser Glu Leu Asn Asn Gin lie Asp Lys Ala Ser Ser Val Ser Pro Thr 
245 250 255 

Asp Tyr 



INFORMATION FOR SEQ ID NO : 91: 



Glu Asn 
15 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 



.209 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GGATGCTCAA GAAACTGCGG GAG TTC ACTA TAAATATGTG GCAGATTCAG AGCTATCATC 

AGAAGAAAAG AAGCAGCTTG TCTATGATAT " TCCGACATAC GTGGAGAATG ATGATGAAAC . 

TTATTATCTT GTTTATAAGT TAAATTCTCA AAATCAACTG GCGGAATTGC CAAATACTGG 

AAGCAAGAAT GAGAGGCAA 

(2) INFORMATION FOR SEQ ID NO : 92 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 
'(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Asp Ala Gin Glu Thr Ala Gly Val His Tyr Lys Tyr Val Ala Asp Ser 
1 5 10 15 

Glu Leu Ser Ser Glu Glu Lys Lys Gin Leu Val Tyr Asp lie Pro Thr 
20 25 30 

Tyr Val Glu Asn Asp Asp Glu Thr Tyr Tyr Leu Val Tyr Lys Leu Asn 
35 40 45 

Ser Gin Asn Gin Leu Ala Glu Leu Pro Asn Thr Gly Ser Lys Asn Glu 
50 55 60 

Arg Gin 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 base pairs 

(B) . TYPE: nucleic acid 
.■(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
CGACAAAGGT GAGACTGAGG TTCAACCAGA GTC GCCAGAT ACTGTGGTAA GTGATAAAGG 
TGAACCAGAG CAGGTAGCAC CGCTTCCAGA ATATAAGGGT AATATTGAGC AAGT AAAAC C 



TGAAACTCCG GTTGAGAAGA CCAAAGAACA— AGGTCCAGAA AAAACTGAAG AAGTTCCAGT 
AAAACCAACA GAAGAAACAC CAGTAAATCC AAATGAAGGT ACTACAGAAG GAACCTCAAT 
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TCAAGAAGCA G AAAATC C AG TTCAACCTGC AGAAGAATCA ACAACGAATT CAGAGAAAGT 
ATCACCAGAT ACATCTAGCA AAAATACTGG GGAAGTGTCC AGTAATCCTA GTGATTCGAC • 
AACCTCAGTT GGAGAATCAA ATAAACCAGA ACATAATGAC TCTAAAAATG AAAATTCAGA 
AAAAACTGTA GAAGAAGTTC CAGTAAATCC AAATGAAGGC AC AGTAGAAG GTACCTCAAA 
TCAAGAAACA GAAAAAC C AG TTCAACCTGC AGAAGAAACA CAAACAAACT CTGGGAAAAT 
AGCTAACGAA AATACTGGAG AAGTATCCAA TAAAC CTAGT GATTCAAAAC CACCAGTTGA 
AGAATCAAAT CAACCAGAAA AAAACGGAAC TGCAACAAAA CCAGAAAATT CAGGTAATAC 
AACATCAGAG AATGGACAAA CAGAACCAGA ACCATCAAAC GGAAATTCAA CTGAGGATGT 
TTCAACCGAA TCAAACACAT C C AATTC AAA TGGAAAC GAA GAAATTAAAC AAGAAAATGA 
ACTAGACCCT GATAAAAAGG TAGAAGAACC AGAGAAAACA CTTGAATTAA GAAAT 
~ {2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:94:. 

Asp. Lys Gly Glu Thr Glu Val Gin Pro Glu Ser Pro Asp Thr Val Val 
1 5 10 15 

Ser Asp Lys- Gly Glu Pro Glu Gin Val Ala Pro Leu Pro Glu Tyr Lys 
20 25 30 

Gly Asn lie Glu Gin Val Lys Pro Glu Thr Pro Val Glu Lys Thr Lys 
35 40 45 

Glu Gin Gly Pro Glu Lys Thr Glu Glu Val Pro Val Lys Pro Thr Glu 
50 55 60 

Glu Thr Pro Val Asn Pro Asn Glu Gly Thr Thr Glu Gly Thr Ser lie 
65 70 75 80 

Gin Glu Ala Glu Asn Pro Val Gin Pro Ala Glu Glu Ser Thr Thr Asn 
85 90 95 

Ser Glu Lys Val Ser Pro Asp Thr Ser Ser Lys Asn Thr Gly Glu Val 
100 105 110 

Ser Ser Asn Pro Ser Asp Ser Thr Thr Ser Val Gly Glu Ser Asn Lys 
115 120 125 

"Pro Glu His Asn Asp Ser' Lys Asn Glu Asn Ser Glu Lys Thr Val Glu. 
130 * ~" 135 140 



.Glu Val Pro Val Asn Pro Asn Glu Gly Thr Val Glu Gly Thr Ser Asn 
145" 150 155 - 160 
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Gin Glu Thr Glu Lys Pro Val Gin Pro Ala Glu Glu Thr Gin Thr Asn 
165 170 175 

Ser Gly Lys lie Ala Asn Glu Asn Thr Gly Glu Val Ser Asn Lys Pro 

. 180- * 185 190 

Ser Asp Ser Lys Pro Pro Val Glu Glu Ser Asn Gin Pro Glu Lys Asn 
195 200 205 

Gly Thr Ala Thr Lys Pro Glu Asn Ser Gly Asn Thr Thr Ser Glu Asn 
210 215 220 

Gly Gin Thr Glu Pro Glu Pro Ser Asn Gly Asn Ser Thr Glu Asp Val 
225 ■ 230 235 240 

Ser Thr Glu Ser Asn Thr Ser Asn Ser Asn Gly Asn Glu Glu lie Lys 
245 250 255 

Gin Glu Asn Glu Leu Asp Pro Asp Lys Lys Val Glu Glu Pro Glu Lys 

260 - - 265 270 

Thr Leu Glu Leu Arg Asn 
275 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC ACTAAACTGA CTGCTGAAAA . ,60 

ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA GATATAACTC ATTTAACAGA 120 

TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT TCAGCATTAG ACGGAGCGAC 180 

AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC CCAGATGGTT CAGTAGTGAC 240 

GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT GAATCTGTAA CTCAAGAAGC 3 00 

TACACCAGAG T AT AAGC TAG AAAATACACC AGGTGGAGAT AAGGGAGGCA ATACTGGAAG 3 60 

CTCAGATGCT AATGC GAATG AAGGCGGTGG TAGCCAGGCG GGTGGATCAG CTCACACAGG 420 

TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA GCTACTGAAA AAGAATCAGC . 480 

TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT GAAATCAAAG GCGCACCGCT 540 

TTC TG AT AAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA GCAGAAAAAC AAGCAGCTCT 600 

CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG GAAGCAGAAA CGATTGGAGT 660 

GCAAGCCATT GCCATGGTTA CAGTTCCTAA GAGACCAGTG GCTCCTAAT 709 
(2) ■ INFORMATION FOR SEQ ID NO: 96: """""" 
<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 6 amino acids 

(B) . TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE . TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Asn Gin Leu Val Ala Gin Asp Pro Lys Ala- Gin Asp Ser Thr Lys Leu 
1 5 10 15 

Thr Ala Glu Lys Ser Thr Val Lys Ala Pro Ala Gin Arg Val Asp Val 
20 25 30 

Lys Asp lie Thr His Leu Thr Asp Glu Glu Lys Val Lys Val Ala lie 
35 40 45 

Leu Gin Ala Asn Gly Ser Ala Leu Asp Gly Ala Thf~ lie Asn Val Ala 

50 55 ~60 - 

Gly Asp Gly Thr Ala Thr lie Thr Phe Pro Asp Gly Ser Val Val Thr 
65 70 75 80 

lie Leu Gly Lys Asp Thr Val Gin Gin Ser Ala Lys Gly Glu Ser Val 

85 90 , ' 95 

Thr Gin Glu Ala Thr Pro Glu Tyr Lys Leu Glu Asn Thr Pro Gly Gly 
100 105 110 

Asp Lys Gly Gly Asn Thr Gly Ser Ser Asp Ala Asn Ala Asn Glu Gly 
115 ' 120 125 

Gly Gly Ser Gin Ala Gly Gly Ser Ala His Thr Gly Ser Gin Asn Ser 
130 135 140 

Ala Gin Ser Gin Ala Ser Lys Gin Leu Ala Thr Glu Lys Glu Ser Ala 
145 150 ,155 160 

Lys Asn Ala lie Glu Lys Ala Ala Lys Asp Lys Gin Asp Glu ile Lys 
. 165 170 175 

Gly Ala Pro Leu Ser Asp Lys Glu Lys Ala Glu Leu Leu Ala Arg Val 
180 185 190 

Glu Ala Glu Lys Gin Ala Ala Leu Lys Glu Ile Glu Asn Ala Lys Thr 
. 195 200 ' 205 

Met Glu Asp Val Lys Glu Ala Glu Thr Ile Gly Val Gin Ala lie Ala 
210 215 220 

Met Val Thr Val Pro Lys Arg Pro Val Ala Pro Asn 
225 230 235 

INFORMATION FOR SEQ ID NO: 97 :~" 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic ■ acid 

_ (C) STRANDEDNESS: double - 
(D) TOPOLOGY: linear 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CAAACAGTCA GCTTCAGGAA CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG 60 

GGGTGCCTTC ACAGAAATCA CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA ' 120 

CACTGCCAAA AC AGC TGTGA TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG 18 0 

GAATGCTAAT GCTATCGGCT ACATCTCCTT GGGATC TTTA ACGAAATCTG TCAAGGCTTT 240 

AGAGATTGAT GGTGTCAAGG- CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA 3 00 

ACGTCCCTTC AACATTGTTT GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG ' 3 60 

C TTTATC C AC TCCAAACAAG GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA .420 

AACCGAAACC ACGGAATATA CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC 480 

CACTTCAGTA TCTTCTTTAA TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATC C AG A 540 

AGTT AC G ATT GATATTACCT CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA 600 

AACCGCTGAT ATTGGTATGG TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC 660 

CCATGATGCT ATTGC TTT AG ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG 72 0 

CCAAGTCAGT ATGGC TGAAC TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA 780 

GATTAAA . 787 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

Lys Gin Ser Ala Ser Gly Thr lie Glu Val lie Ser Arg Glu Asn Gly 
1 5 10 . 15 

Ser Gly Thr Arg Gly Ala Phe Thr Glu lie Thr Gly lie Leu Lys Lys 
20 25 30 

Asp Gly Asp Lys Lys lie Asp Asn Thr Ala Lys Thr Ala Val lie Gin 

35 .40 45 

Asn Ser Thr Glu Gly Val Leu Ser Ala Val Gin Gly Asn Ala' Asn Ala 
50 55' . 60" 

lie Gly Tyr lie Ser Leu Gly Ser Leu Thr Lys Ser Val Lys Ala Leu_" 
65 . 70 75 80" 



Glu lie Asp Gly Val Lys Ala Ser Arg Asp Thr Val Leu-Asp Gly Glu . 

85 . 90 ~ 95 . 
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Tyr Pro Leu Gin Arg Pro Phe Asn lie Val Trp Ser Ser Asn Leu Ser 
100 105 110 

Lys Leu Gly Gin Asp Phe lie Ser Phe lie His Ser Lys Gin Gly Gin 
115 120 125 

Gin Val Val Thr Asp Asn Lys Phe lie Glu. Ala Lys Thr Glu Thr_Thr 
130 135 140 

Glu Tyr Thr Ser Gin His Leu Ser Gly Lys Leu Ser Val Val Gly Ser 
145 150 155 • 160 

Thr Ser Val Ser Ser Leu Met Glu Lys Leu Ala Glu Ala Tyr Lys Lys 
165 170 175 

Glu Asn Pro Glu Val Thr lie Asp lie Thr Ser Asn Gly Ser Ser Ala 
180. . - 185 190 

Gly lie Thr Ala Val Lys Glu Lys Thr Ala Asp lie Gly Met Val Ser 
195 200 205 

Arg Glu Leu Thr Pro Glu Glu Gly Lys Ser Leu Thr His Asp Ala lie 

210 215 220 ■* . 



Ala Leu Asp Gly lie Ala Val Val Val Asn Asn Asp Asn Lys Ala Ser 
225 ^ 230 235 240 

Gin Val Ser Met Ala Glu Leu Ala Asp Val Phe Ser Gly Lys Leu Thr 
245 250 255 

Thr Trp Asp Lys lie Lys 

. 260 

(2) INFORMATION FOR SEQ ID NO: 99: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

ATTCGATGAT GCGGATGAAA AGATGACCCG TGATGAAATT GCCTATATGC TGAC AAATAG 60 

TGAAGAAACA - TTGGATGCTG ATGAGATTGA GATGCTACAA GGTGTCTTTT CGCTCGATGA 120 

ACTGATGGCA C GAG AGGTTA TGGTTCCTCG AACGGATGCC TTTATGGTGG ATATTCAGGA 180 

TGATAGTCAA GCCATTATCC AAAGTATTTT AAAACAAAAT TATTCTCGTA TCCCGGTTTA 240 

TGATGGGGAT AAGGACAATG TAATTGGAAT CATTCACACC AAGAGTCTCC TTAAGGCAGG 3 00 

CTTTGTGGAC GGTTTTGACA ATATTGTTTG GAAGAGAATT TTACAAGATC CACTTTTTGT ~ 3 60 

ACCTGAAACT ATTTTTGTGG ATGACTTGCT AAAAGAACTG CGAAATACCC AAAGACAAAT 420 
G . -.\__- 421 

(2) INFORMATION FOR SEQ ID NO: 100: L 
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(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Phe Asp Asp Ala Asp Glu Lys Met Thr Arg Asp Glu lie Ala Tyr Met 
1 5 .10 15 

Leu Thr Asn Ser Glu Glu Thr Leu Asp Ala Asp Glu lie Glu Met Leu 
20 25 30 

Gin Gly Val Phe Ser Leu Asp Glu Leu Met Ala Arg Glu Val Met Val 
35 40 ■ 45 

Pro Arg Thr Asp Ala Phe Met Val Asp lie Gin Asp Asp Ser Gin Ala 
50 55 60 

lie lie Gin Ser lie Leu Lys Gin Asn Tyr Ser Arg lie Pro Val Tyr 
65 70 75 80 

Asp Gly Asp Lys Asp Asn Val lie Gly lie lie His Thr Lys Ser Leu 
85 90 95 

Leu Lys Ala Gly Phe Val Asp Gly Phe Asp Asn lie Val Trp Lys Arg 
100 105 110 

lie Leu Gin Asp Pro Leu Phe Val Pro Glu Thr lie Phe Val Asp Asp 
115 120 125 . 

Leu Leu Lys Glu Leu Arg Asn Thr Gin Arg Gin Met 
130 135 . 140 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi)" SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

GGAGAGTCGA TCAAAAGTAG ATGAAGCTGT GTCTAAGTTT GAAAAGGACT CATCTTCTTC 60 

. GTC AAGTTC A GACTCTTCC A CTAAACCGGA AGCTTCAGAT ACAGCGAAGC CAAACAAGCC 120 

GACAGAACCA GGAGAAAAGG TAGCAGAAGC TAAGAAGAAG GTTGAAGAAG CTGAGAAAAA 180 

AGCCAAGGAT CAAAAAGAAG AAGATCGTCG TAACTACCCA ACCATTACTT ACAAAACGCT 240 

TGAACTTGAA ATTGCTGAGT CCGATGTGGA AGTTAAAAAA GCGGAGCTTG AACTAGTAAA 3 00 

AGTGAAAGCT AACGAACCTC ... GAGACGAGCA A 331 
(2) INFORMATION FOR SEQ ID NO: 102 
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' (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 102 : 

Glu Ser Arg Ser Lys Val Asp Glu Ala Val Ser Lys Phe Glu Lys Asp 
1 5 . , 10 15 

Ser Ser Ser Ser Ser Ser Ser Asp Ser Ser Thr Lys Pro Glu Ala Ser 
20 25 30 

Asp Thr Ala Lys Pro Asn Lys Pro "Thr Glu Pro Gly Glu Lys Val Ala 
35 40 45 

Glu Ala Lys Lys Lys Val Glu Glu Ala Glu Lys Lys Ala Lys Asp Gin 

50 55- "■' 60 

Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr lie Thr Tyr Lys Thr Leu 
65 70 75 80 

Glu Leu Glu lie Ala Glu Ser Asp Val Glu Val Lys Lys Ala Glu Leu 
85 90 95 

Glu Leu Val Lys Val Lys Ala Asn Glu Pro Arg Asp Glu Gin 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 103: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
ATGGACAACA GGAAACTGGG AC G AGGTT AT ATCTGGTAAG ATTGACAAGT ACAAAGATCC - - 60 

AGATATTCCA ACAGTTGAAT CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC 120 

GGTAAGGTAT GACCGTTTAT CAACACCAGA AAAACCAATC CCACAACCAA. ATCCAGAGCA 180 

TCCAAGTGTT CCGACACCAA ACCCAGAACT ACCAAATCAA. GAGACTCCAA CACCAGATAA 240 

ACCAACTCCA GAACCAGGTA CTCCAAAAAC TGAAACTCCA GTGAATCCAG ACCCAGAAGT 3 00 

TCCGACTTAT GAGACAGGTA AGAGAGAGGA ATTGC C AAAC ACAGGTACAG AAGCTAAT 3 58 
(2) INFORMATION FOR SEQ ID NO: 104: • 

^ CONSEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

Trp Thr Thr Gly Asn Trp Asp Glu Val lie Ser Gly Lys lie Asp Lys 
1 5 10 15 

Tyr Lys Asp Pro Asp lie Pro Thr Val Glu Ser Gin Glu Val Thr Ser 
'20 25 30 

Asp Ser Ser Asp Lys Glu lie Thr Val Arg Tyr Asp Arg Leu Ser Thr 
35 .40 45 

Pro Glu Lys Pro lie Pro Gin Pro Asn Pro Glu His Pro Ser Val Pro 
50 55 60 

Thr Pro Asn Pro Glu Leu Pro Asn Gin Glu Thr Pro Thr Pro Asp Lys 
65 70 75 80 

Pro Thr Pro Glu Pro Gly Thr Pro Lys Thr Glu Thr Pro Val Asn Pro 
'85 90 95 

Asp Pro Glu Val Pro Thr Tyr Glu Thr Gly Lys Arg Glu Glu Leu Pro 
100 105 110 

- Asn Thr Gly Thr Glu Ala. Asn 
115 

(2) . INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : -linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CGATGGGCTC AATCCAACCC CAGGTCAAGT CTTACCTGAA GAGACATCGG GAACGAAAGA 60 

GGGTGACTTA TCAGAAAAAC CAGGAGACAC CGTTCTCACT CAAGCGAAAC CTGAGGGCGT . 12 0 

TACTGGAAAT ACGAATTCAC TTCCGACACC TACAGAAAGA ACTGAAGTGA GCGAGGAAAC 180 

AAGCCCTTCT AGTCTGGATA CACTTTTTGA AAAAGATGAA GAAGCTCAAA AAAATCCAGA 240 

GCTAACAGAT GTCTTAAAAG AAACTGTAGA TACAGCTGAT GTGGATGGGA CACAAGCAAG 300 

TCCAGCAGAA ACTACTCCTG AACAAGTAAA AGGTGGAGTG AAAGAAAATA CAAAAGACAG 3 60 

CATC GATGTT CCTGCTGCTT ATCTTGAAAA AGCTGAAGGG AAAGGTCCTT TCACTGCCGG 42 0 

TGTAAACCAA GTAATTCCTT ATGAACTATT CGCTGGTGAT GGTATGTTAA CTCGTCTATT 48 0 

ACTAAAAGCT -TCGGATAATG— GTCGTTGGTC TGACAATGGT ACTGCTAAAA ATCCTGCTTT 540 

ACCTCCTCTT GAAGGATTAA CAAAAGGGAA ATACTTCTAT . GAAGTAGACT TAAATGGCAA 600 
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TACTGTTGGT AAACAAGGTC AAGCTTTAAT TGATCAACTT CGCGCTAATG GTACTCAAAC 660 

TTATAAAGCT ACTGTTAAAG TTTACGGAAA TAAAGACGGT AAAGCTGACT TGACTAATCT 720 

AGTTGCTACT AAAAATGTAG ACATCAACAT CAATGGATTA GTTGCTAAAG AAACAGTTCA 78 0 

AAAAGCCGTT GCAGACAACG TTAAAGACAG TATCGATGTT CCAGCAGCCT ACCTAGAAAA 840 

AGCCAAGGGT GAAGGTCCAT TCACAGCAGG TGTCAACCAT GTGATTCCAT ACGAACTCTT 900 

CGCAGGTGAT GGCATGTTGA CTCGTCTCTT GCTCAAGGCA TCTGACAAGG CACCATGGTC 960 

AGATAACGGG GACGCTAAAA ACCCAGCCCT ATCTCCACTA GGCGAAAACG TGAAGACCAA 1020 

AGGTCAATAC TTCTATCAAN ■ TAGCCTTGGA CGGAAATGTA GCTGGCAAAG AAAAACAAGC 1080 

GCTCATTGAC CAGTTCCGAG CAAANGGTAC TCAAACTTAC AGCGCTACAG TCAATGTCTA , 1140 

TGGTAACAAA GACGGTAAAC CAGACTTGGA CAACATCGTA GCAACTAAAA AAGTC AC TAT 1200 

"TAACATAAAC GGTTTAATTT CTAAAGAAAC AGTTCAAAAA GCCGTTGCAG ACAACGTTAA 1260 

NGACAGTATC GATGTTCCAG CAGCCTACCT AGAAAAAGCC AAGGGTGAAG GTCCATTCAC 1320 

AGCAGGTGTC AACCATGTGA TTCCATACGA ACTCTTCGCA GGTGATGGTA TGTTGACTCG 1380 

TCTCTTGCTC AAGGCATCTG AC AAGGC AC C ATGGTCAGAT AACGGNGACG CTAAAAACCC 1440 

AGCNCTATCT CCACTAGGTG AAAACGTGAA GACCAAAGGT CAATACTTCT ATCAANTAGC 1500 

C TTGGAC GGA AATGTAGCTG GCAAAGAAAA AC AAGC GCTC ATTGACCAGT TCCGAGCAAA 1560 

CGGTACTCAA ACTTACAGCG CTACAGTCAA TGTCTATGGT AACAAAGACG GTAAACCAGA 162 0 

CTTGGACAAC ATCGTAGCAA CTAAAAAAGT CACTATTAAG ATAAATGTTA AAGAAACATC . 1680 

AGACACAGCA AATGGTTCAT TATCACCTTC TAACTCTGGT TCTGGCGTGA CTCCGATGAA _ 1740 

TCACAATCAT GCTACAGGTA CTACAGATAG CATGCCTGCT GACACCATGA CAAGTTCTAC 1800 

CAACACGATG GC AGG TG AAA ACATGGCTGC TTCTGC TAAC AAGATGTCTG ATACGATGAT .1860 

GTCAGAGGAT AAAGCTATG 1879 
(2) INFORMATION FOR SEQ ID NO: 106: 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 6 amino acids 

(B) ' TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE' TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: - ~- 

Asp Gly Leu Asn Pro Thr Pro Gly Gin Val Leu" Pro Glu Glu Thr Ser 
1 5 10 15 



Gly Thr Lys. Glu Gly Asp Leu Ser Glu Lys Pro Gly Asp Thr Val Leu 
20 25 30 



Thr Gin Ala Lys Pro Glu Gly Val Thr Gly Asn Thr Asn Ser Leu Pro 
35 - 40 45 

Thr Pro Thr Glu -Arg Thr Glu Val Ser Glu Glu Thr Ser Pro Ser Ser 
50— 5 5 60 

Leu Asp Thr* Leu Phe Glu Lys Asp Glu Glu Ala Gin Lys Asn Pro Glu 
65 70 . 75 80 

Leu Thr Asp Val Leu Lys Glu Thr Val Asp Thr Ala Asp Val Asp Gly 
85 90 95 

Thr Gin Ala Ser Pro Ala Glu Thr Thr Pro Glu Gin Val Lys Gly Gly 
100 105 - 110' 

Val Lys Glu Asn Thr Lys Asp Ser lie Asp Val Pro Ala Ala Tyr Leu 
.115 120 - 125 

Glu Lys Ala Glu Gly Lys Gly Pro Phe Thr. Ala Gly Val Asn Gin Val 
130 135 140 

lie Pro Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Thr Arg Leu Leu 
145 150 155 160 

Leu Lys Aia Ser Asp Asn Ala Pro Trp Ser Asp Asn Gly Thr Ala Lys 
165 170 175 

Asn Pro Ala Leu Pro Pro Leu Glu Gly Leu Thr Lys Gly Lys Tyr Phe 
180. 185 190 

Tyr Glu Val Asp Leu Asn Gly Asn Thr Val Gly Lys Gin Gly Gin Ala 
195^ 200 205 

Leu lie Asp Gin Leu Arg Ala Asn Gly Thr Gin Thr Tyr Lys Ala Thr 
210 215 220 

Val Lys Val Tyr Gly Asn Lys Asp Gly Lys Ala Asp Leu Thr Asn Leu 
225^ 230 235 240 

Val Ala Thr Lys Asn Val Asp lie Asn lie Asn Gly Leu Val Ala Lys 
245 ^ 250 255 

Glu Thr Val Gin Lys Ala Val Ala Asp Asn Val Lys Asp Ser lie Asp 
260 * 265 ' 270 

Val Pro Ala Ala Tyr Leu Glu Lys Ala Lys Gly Glu Gly Pro Phe Thr 
275 . 280 285 

Ala Gly Val Ash His Val lie Pro Tyr Glu Leu Phe Ala Gly Asp Gly 
290 295 300 

Met Leu Thr Arg Leu Leu Leu Lys Ala Ser Asp Lys Ala Pro Trp Ser 
305 — 310 315 320 

_Asp Asn Gly Asp Ala Lys Asn Pro Ala Leu Ser Pro Leu Gly Glu Asn 
325 330 - 335 

\ ■ i 

Val Lys Thr Lys Gly Gin Tyr' Phe Tyr Gin Xaa Ala Leu' Asp Gly Asn 
340*" 345 350 



-Va-l-Ala Gly Lys Glu Lys -Gin Ala Leu He Asp Gin Phe Arg Ala Xaa 
355 360 365 
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Gly Thr Gin Thr Tyr Ser Ala Thr Val Asn Val Tyr Gly Asn Lys Asp 

370 375 380 

Gly Lys Pro Asp Leu Asp Asn. lie Val Ala Thr Lys Lys Val Thr lie 
385 . 390_ - 395 400' 

Asn lie Asn Gly Leu lie Ser Lys Glu Thr Val Gin. Lys Ala Val Ala 
405 410 415 

Asp" Asn Val Xaa Asp Ser lie Asp Val Pro Ala Ala Tyr Leu Glu Lys 
420 425 430 

Ala Lys Gly Glu Gly Pro Phe Thr Ala Gly Val Asn His Val lie Pro 
' 435 440 445 

Tyr Glu Leu Phe Ala Gly Asp Gly Met Leu Thr. Arg Leu Leu Leu Lys 

450 455 460 

Ala Ser Asp Lys Ala Pro Trp Ser Asp Asn Gly Asp Ala Lys Asn Pro 
465 ' 470 475 ' 480 



Ala Leu Ser Pro Leu Gly Glu Asn Val Lys Thr Lys Gly Gin Tyr Phe 
485 490 495 

Tyr Gin Xaa Ala Leu Asp Gly Asn Val Ala. Gly Lys Glu Lys Gin Ala 
500 505 510 

Leu lie Asp Gin Phe Arg Ala Asn Gly Thr Gin Thr Tyr Ser Ala Thr 
515 520 525 

Val Asn Val Tyr Gly Asn Lys Asp Gly Lys Pro Asp Leu Asp Asn lie 
530 535 540 

Val Ala Thr Lys Lys Val Thr lie Lys lie Asn Val Lys Glu Thr Ser 
545 550 555 560 

Asp. Thr Ala Asn Gly Ser Leu Ser Pro Ser Asn Ser Gly Ser Gly Val 
565 570 .~ 575 

Thr Pro Met Asn His Asn His Ala Thr Gly Thr Thr Asp Ser Met Pro 
580 585 590 

Ala Asp Thr Met Thr Ser Ser Thr Asn Thr Met Ala Gly Glu Asn Met 
595 600 605 

Ala -Ala Ser Ala Asn Lys Met Ser Asp Thr Met Met Ser Glu Asp Lys 
610 615 620 

Ala Met 
625 

(2) INFORMATION FOR SEQ ID NO: 107: 

(1) SEQUENCE CHARACTERISTICS: 

(A) " LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



-(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
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• TTCCAATCAA AAAC AGGC AG ATGGTAAACT CAATATCGTG ACAACCTTTT ACCCTGTCTA 6Q 

TGArTTTACC AAGCAAGTCG CAGGAGATAC GGCTAATGTA GAACTCCTAA TCGGTGCTGG. 120 

GACAGAACCT CATGAATACG AACCATCTGC CAAGGCAGTT GGCAAAATCC AAGATGCAGA 18 0 

TACCTTCGTT TATGAAAATG AAAACATGGA AACATGGGTA CCTAAATTGC TAG AT AC C TT 240 

GGATAAGAAA AAAGTGAAAA CCATCAAGGC G AC AGGC GAT ATGTTGCTCT TGCCAGGTGG 300 

CGAGGAAGAA GAGGGAGACC ATGACCATGG AGAAGAAGGT CATCACCATG AGTTTGACCC 3 60 

CCATGTTTGG TTATCACCAG TTC GTGC CAT tAAACTAGTA GAGCACCATC CGCGACACTT 420 

GTCAGCAGAT TATCCTGATA AAAAAGAGAC CTTTGAGAAG AATGCAGCTG CCTATATCGA 480 
AAAATTGCAA GCCTTGGATA AGGCTTACGC AGAAGGTTTG TCTCAAGCAA AACAAAAGAG . 540 

CTTTGTGACT CAACACGCAg CCTTTAACTa TCTTGCCTTG GACTATGGGA CTC 593 
(2) INFORMATION FOR SEQ ID. NO: 108: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

Ser Asn Gin Lys Gin Ala Asp Gly Lys Leu Asn lie Val Thr Thr Phe 
1 5 10 15 

Tyr Pro Val Tyr Glu Phe Thr Lys Gin Val Ala Gly Asp Thr Ala Asn 
20 25 .30 

Val Glu Leu Leu lie Gly Ala Gly Thr Glu Pro His Glu Tyr Glu Pro 
35 40 45 

Ser Ala Lys Ala Val Ala Lys lie Gin Asp Ala Asp Thr Phe Val Tyr 

50 55 60 : 

Glu Asn Glu Asn Met Glu Thr Trp Val Pro Lys Leu Leu Asp Thr Leu 
65 70 75 80 

Asp Lys Lys Lys Val Lys Thr lie Lys Ala Thr Gly Asp Met Leu Leu 
85 90 95 

Leu Pro Gly Gly Glu Glu Glu Glu Gly Asp His -Asp His Gly Glu Glu 
100 105 " " 110 

Gly His His His Glu Phe Asp Pro His Val Trp Leu Ser, Pro Val Arg 
115 120 ~ 125 

Ala lie Lys Leu Val Glu His His Pro Arg His Leu Ser__Ala Asp -Tyr 
130 135 140 „ 

Pro Asp Lys Lys Glu Thr Phe Glu Lys Asn- Ala Ala Ala Tyr lie Glu 
145 — 150 ■ 155 '160 -- 
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Lys Leu Gin Ala Leu Asp Lys Ala Tyr Ala Glu Gly Leu Ser Gin Ala 
165 170 175 

Lys Gin Lys Ser Phe Val Thr Gin His Ala Ala Phe Asn Tyr Leu Ala 
— 180 185 190 

Leu _Asp Tyr Gly Thr 

195 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: .linear 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: ■_ 

Vj TATCACAGGA TCGAACGGTA AGACAACCAC AAC GACTATG ATTGGGGAAG TTTTGACTGC 60 

jJ>J TGCTGGCCAA CATGGTCTTT TATCAGGGAA TATCGGCTAT CCAGCTAGTC AGGTTGCTCA 12 0 

AATAGCATCA GATAAGGACA CGCTTGTTAT GGAACTTTCT TCTTTCCAAC TCATGGGTGT 18 0 

TCAAGAATTC CATCCAGAGA TTGCGGTTAT TACCAACCTC ATGCCAACTC ATATCGACTA 240 

Q CCATGGGTCA TTTTC GGAAT ATGTAGCAGC CAAGTGGAAT ATCCAGAACA AGATGACAGC 3 00 

111 AGCTGATTTC CTTGTCTTGA ACTTTAATCA AGAC TTGGC A AAAGACTTGA CTTCCAAGAC 3 60 

AGAAGC C ACT GTTGT AC CAT TTTCAACACT TGAAAAGGTT GATGGAGCTT ATCTGGAAGA 42 0 

H TGGTCAACTC TACTTCCGTG GTGAAGTAGT CATGGCAGCG AATGAAATCG GTGTTCCAGG 480 

TAGCCACAAT GTGGAAAATG CCCTTGCGAC TATTGCTGTA GCCAAGCTTC GTGATGTGGA 540 

CAATCAAACC ATCAAGGAAA CTCTTTCAGC CTTCGGTGGT GTCAAACACC GTCTCC AGTT 600 

TGTGGATGAC ATCAAGGGTG TTAAATTCTA TAACGACAGT AAATCAACTA ATATCTTGGC 660 

TACTCAAAAA GCCTTGTCAG GATTTGACAA CAGCAAGGTC GTCTTGATTG CAGGTGGTTT 72 0 

GGACCGTGGC AATGAGTTTG ACGAATTGGT GCCAGACATT ACTGGACTCA AGAAGATGGT 78 0 

CATCCTGGGT CAATCTGCAG AACGTGTCAA ACGGGCAGCA GAC AAGGC TG GTGTC GCTTA 840 

TGTGGAGGCG ACAGATATTG CAGATGCGAC CCGCAAGGCC TATGAGCTTG CGACTCAAGG '900 

AGATGTGGTT CTTCTTAGTC CTGCCAATGC TAGCTGGGAT ATGTATGCTA ACTTTGAAGT 960 

AC GTGGCGAC CTCTTTATCG ACACAGTAGC GGAGTTAAAA GAA , 1003 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

__. (C) STRANDEDNESS: ' single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Gly lie Thr Gly Ser Asn Gly Lys Thr Thr Thr Thr Thr Met lie Gly 
1 5 10 15 

Glu Val Leu Thr Ala Ala Gly Gin His Gly Leu Leu Ser Gly Asn lie 
20 25 30 

Gly Tyr Pro Ala Ser Gin Val Ala Gin lie Ala Ser Asp Lys Asp Thr 
35 .40 45 

Leu Val Met Glu Leu Ser Ser Phe Gin Leu' Met Gly Val Gin Glu Phe 
50 55 60 

His Pro Glu lie Ala Val lie Thr Asn Leu Met Pro Thr His lie Asp 
65 70- " 75 80 

Tyr His Gly Ser Phe Ser Glu Tyr Val Ala Ala Lys Trp Asn lie Gin 
85 90 95 

Asn Lys Met Thr Ala Ala Asp Phe Leu Val Leu Asn Phe Asn Gin Asp 
100 105 110 

Leu Ala Lys Asp Leu Thr Ser Lys Thr Glu Ala Thr Val Val Pro Phe 
115 120 125 



Ser Thr Leu Glu Lys Val Asp Gly Ala Tyr Leu Glu Asp Gly Gin Leu 
130 135 140 

Tyr Phe Arg Gly Glu Val Val Met Ala Ala Asn Glu lie Gly Val Pro 
145 150 155 ~ 160 

Gly Ser His Asn Val Glu Asn Ala Leu Ala Thr lie Ala Val Ala Lys 
165 170 . 175 

Leu Arg Asp Val Asp Asn" Gin Thr lie Lys Glu Thr Leu Ser Ala Phe 
180 185 190 

Gly Gly Val Lys His Arg Leu Gin Phe Val Asp Asp lie Lys Gly Val 
195 200 205 

Lys Phe Tyr Asn Asp Ser Lys Ser Thr Asn lie Leu Ala Thr Gin Lys 
210 215 220 

Ala Leu Ser Gly Phe Asp Asn Ser Lys Val Val Leu lie Ala Gly Gly 
225 230 235 240 

Leu Asp Arg Gly Asn Glu Phe Asp Glu Leu Val Pro Asp He Thr Gly 
245 250 255 

Leu Lys Lys Met Val He Leu Gly Gin Ser Ala Glu Arg Val Lys Arg 
260 265 270 

Ala Ala Asp Lys Ala Gly Val Ala Tyr Val Glu Ala Thr Asp lie Ala 
275 280 285 

Asp Ala Thr Arg Lys Ala Tyr Glu Leu Ala Thr Gin Gly Asp Val Val 

290 ... \ - 295 300 

Leu Leu Ser Pro Ala ' Asn Ala Ser Trp Asp Met Tyr Ala Asn Phe Glu 



305 



310 



315 



320 



Val Arg Gly Asp Leu Phe lie Asp Thr Val Ala Glu Leu Lys Glu 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
AAGTTCATCG AAGATGGTTG GGAAGTCCAC TATATCGGGG ACAAGTGTGG TATCGAACAC 
CAAGAAATCC TTAAGTCAGG TTTGGATGTC ACCTTCCATT CTATTGCGAC TGGAAAATTG 
CGTCGCTATT TCTCTTGGCA AAATATGCTG GACGTCTTCA AAGTTGGTTG GGGAATTGTC 
• CAATCGCTCT TTATCATGTT GCGACTGCGT CCACAGACCC TTTTTTCAAA GGGGGGCTTT 
GTCTCAGTAC CGCCTGTTAT CGCTGCGCGT GTGTCAGGAG TGCCTGTCTT TATTCACGAA 
TCTGACCTGT C TATGGGCTT GGCCAATAAA ATCGCCTATA AATTTGCGAC TAAGATGTAT 
TCAACCTTTG AACAAGCTTC GAGTTTGGCT AAGGTTGAGC ATGTGGGAGC GG 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
-(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Ser Ser Ser Lys Met Val Gly Lys Ser Thr lie 
1 5 • 10 

Val Ser Asn Thr Lys Lys Ser Leu Ser Gin Val Trp Met Ser Pro Ser 
20 25 30 

lie Leu Leu Arg Leu Glu Asn Cys Val Ala lie Ser Leu Gly Lys lie 
35 40 45 

Cys Trp Thr Ser Ser Lys Leu Val Gly Glu Leu Ser Asn Arg Ser Leu 
50 55 '60 

Ser Cys Cys Asp Cys Val His Arg Pro Phe Phe Gin Arg Gly Ala Leu 
65 70 75 80 

Ser Gin Tyr Arg Leu Leu Ser Leu Arg Val Cys Gin Glu Cys Leu Ser 

85 — 90 95 

Leu Phe Thr Asn Leu Thr Cys Leu Trp Ala Trp Pro lie Lys Ser Pro 



Ser Gly Thr Ser Val 

■ 15 
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100 1 105 . 110 

lie Asn Leu Arg Leu Arg Cys lie Gin Pro Leu Asn Lys Leu Arg Val 
115 120 125 

Trp Leu Arg Leu Ser Met Trp Glu Arg 
130 ^ 135 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 
— - (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

ATCGCTAGCT AGTGAAATGC AAGAAAGTAC ACGTAAATTC AAGGTTACTG CTGACCTAAC 60 

AGATGCCGGT GTTGGAACGA TTGAAGTTCC TTTGAGCATT GAAGATTTAC CCAATGGGCT 12 0 

GACCGCTGTG GCGACTCCGC AAAAAATTAC AGTCAAGATT GGTAAGAAGG CTCAGAAGGA 18 0 

TAAGGTAAAG ATTGT AC C AG AGATTGACCC TAGTCAAATT GATAGTCGGG TACAAATTGA 240 

AAATGTCATG GTGTCAGATA AAGAAGTGTC TATTAC GAGT GACCAAGAGA CATTGGATAG 3 00 

AATTGATAAG ATTATCGCTG TTTTGCCAAC TAGCGAACGT ATAACAGGTA ATTACAGTGG 3 60 

TTCAGTACCT TTGCAGGCAA TCGACCGCAA TGGTGTTGTC TTACCGGCAG TTATCACTCC 42 0 

GTTTGATACA ATAATGAAGG TGACTACAAA ACCAGTAGCA CCAAGTTCAA GCACATCAAA 48 0 

TTCAAGTACA AGCAGTTCAT CGGAGACATC TTCGTCAACG AAAGCAACTA GTTCAAAAAC 54 0 

GAAT 544 
(2) INFORMATION. FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

* 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Ser Leu Ala "Ser Glu Met Gin Glu Ser Thr Arg Lys Phe Lys Val Thr 
1 5 10 15 

Ala Asp Leu Thr Asp Ala Gly Val Giy Thr lie Glu Val Pro Leu Ser 
20 25 30 



-Ile-Glu-Asp Leu Pro Asn Gly Leu Thr Ala Val Ala Thr Pro Gin Lys 
3 5 .40 45 

lie Thr Val Lys lie Gly Lys Lys Ala Gin Lys Asp Lys Val Lys lie 
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50 . " 55 60 

Val Pro Glu lie Asp Pro Ser Gin lie Asp Ser Arg Val Gin lie Glu' 
-65 70 75 80 

Asn Val Met Val Ser Asp Lys Glu Val Ser lie Thr Ser Asp Gin Glu 
,85 90 95 

Thr Leu Asp Arg lie Asp Lys lie lie Ala Val Leu Pro Thr Ser Glu 
100 105 110 

Arg lie Thr Gly Asn Tyr Ser Gly Ser Val Pro Leu Gin Ala He Asp 
115 120 125 

Arg Asn Gly Val Val Leu Pro Ala Val He Thr Pro Phe Asp Thr He 
130' 135 140 

Met Lys Val Thr Thr Lys Pro Val Ala Pro Ser , Ser Ser Thr Ser Asn 
145 * 150 155 160 

Ser Ser Thr Ser Ser Ser Ser Glu Thr Ser Ser Ser Thr Lys Ala Thr 
165 170 175 

Ser Ser Lys Thr Asn 
180 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
■'•(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GCACCAGATG GGGCACAAGG TTCAGGGATC AGATGTTGAA AAGTACTACT TTACCCAACG ' 60 

CGGTCTTGAG CAGGCAGGAA TTACCATTCT TCCTTTTGAT GAAAAAAATC TAGACGGTGA 120 

TATGGAAATT ATCGCTGGAA ATGCCTTTCG TCCAGATAAC AACGTCGAAA TTGCCTATGC 18 0 

. GGACCAAAAT GGTATCAGCT ACAAACGTTA CCATGAGTTT CTAGGTAGCT TTATGCGTGA 240 

CTTTGTTAGC ATGGGAGTAG CAGGAGCACA TGGAAAAACT TCAAC GAC AG GTATGTTGTC 300 

TCATGTCTTG TCTCACATTA CAGATACCAG CTTCTTGATT GGAGATGGGA CAGGTCGTGG 360 

TTCGGCCAAT GCCAAATATT TTGTCTTTGA ATCTGAC GAA TATGAGCGTC ACTTCATGCC 420 

TTACCACCCA GAATACTCTA TTATCACCAA CATTGACTTT GACCATCCAG ATTATTTCAC .480 

AAGTCTCGAG GATGTTTTTA ATGCCTTTAA C GACTATGC C AAACAAATCA CCAAGGGTCT 540 

■ TTTTGTCTAT GGTGAAGATG CTGAATTGCG TAAGATTACG TCTGATGCAC CAATTTATTA 600 

TTATGGTTTT GAAGCTGAAG GCAATGACTT TGTAGC TAGT GATCTTCTTC GTTCAATAAC 660 

— TGGTTCAACC TTCACCGTTC ATTTCCGTGG ACAAAACTTG GGGCAATTCC ACATTCCAAC 720 

CTTTGGTCGT CACAATATCA TGAATGCGAC AGCCGTTATT GGTCTTCTTT ACACAGCAGG 780 
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ATTTGATTTG AACTTGGTGC GTGAGCACTT GAAAACATTT GCCGGTGTTA AAC GTCGTTT 84 0 

CACTGAGAAA ATTGTCAATG ATACAGTGAT TATCGATGAC TTTGCCCACC ATCCAACAGA 900 

AATTATTGCG ACCTTGGATG CGGCTCGTCA GAAATACCCA AGCAAGGAAA TTGTAGCAGT 960 

CTTTCAACCG CATACCTTTA CAAGAACCAT TGCCTTGTTG GACGACTTTG CCCATGCTTT 102 0 
AAACCAAGCA GATGCTGTTT ATCTAGCGCA AATTTATGGC TCGGCTCGTG AAGTAGATCA • 108 0 

TGGTG AC GTT AAGGTAGAAG ACCTAGCCAA CAAAATCAAC AAAAAACACC AAGTGATTAC 1140 

TGTTGAAAAT GTTTC TC C AC TCCTAGACCA TGACAATGCT GTTTACGTCT TTATGGGAGC 1200 

AGGAGACATC CAAACCTATG AATACTCATT TGAGCGTCTC TTGTCTAACT TGACAAGCAA 1260 

TGTTCAA • 1267 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

His Gin Met Gly His Lys Val Gin Gly Ser Asp Val Glu Lys Tyr Tyr 
1 5 10 15 

Phe Thr Gin Arg Gly Leu Glu Gin Ala Gly lie Thr lie Leu Pro Phe 
20 25 30 . 

Asp Glu Lys Asn Leu Asp Gly Asp Met Glu lie lie Ala Gly Asn Ala 
35 40 45 

Phe Arg Pro Asp Asn Asn Val Glu lie Ala Tyr Ala Asp Gin Asn Gly 
50 55 • 60 

lie Ser Tyr Lys Arg Tyr His Glu Phe Leu Gly Ser Phe Met Arg Asp 
65 70 75 80 

Phe Val Ser Met Gly Val Ala Gly Ala His Gly Lys Thr Ser Thr Thr 
85 90 95 

Gly Met Leu Ser His Val Leu Ser His lie Thr Asp Thr Ser Phe Leu 
100 105 ' 110 

lie Gly Asp Gly Thr Gly Arg Gly Ser Ala Asn Ala Lys Tyr Phe Val . 
115 120 125 

Phe Glu Ser Asp Glu Tyr Glu Arg His Phe Met Pro Tyr His Pro Glu 
_ 1?0 135 140 

_ Tyr Ser -lie lie . Thr Asn lie Asp— Phe-Asp-H-i-s— Pro Asp Tyr Phe Thr 
145 150 155 160 . 

Ser Leu Glu Asp Val Phe Asn Ala Phe Asn Asp Tyr Ala Lys Gin lie 
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165 170 175 

Thr Lys Gly Leu Phe Val Tyr Gly Glu Asp Ala Glu Leu Arg Lys lie 
180- 185 190 

Thr Ser Asp Ala Pro lie Tyr Tyr Tyr Gly Phe Glu Ala Glu Gly Asn 

195 . _ ... 200 • 205 

Asp Phe Val Ala Ser Asp Leu Leu Arg Ser lie Thr Gly Ser Thr. Phe 
- 210 215 220 

Thr Val His Phe Arg Gly Gin Asn Leu Gly Gin Phe His lie Pro Thr 
225 230 235 240 

Phe Gly Arg His Asn "lie Met Asn Ala Thr Ala Val lie Gly Leu Leu 
245 . 250 255 

Tyr Thr Ala Gly Phe Asp Leu Asn Leu Val Arg Glu His Leu Lys Thr 
260 _ 265 ■ 270 

Phe Ala Gly Val - Lys" Arg~Arg Phe Thr Glu Lys lie Val Asn Asp Thr 
275 280 285 

Val lie lie Asp Asp Phe Ala His His Pro Thr Glu lie lie Ala Thr 
290 295 . 300 

Leu Asp Ala Ala Arg Gin Lys Tyr Pro Ser Lys Glu lie Val Ala Val 
305 310 315 320 

Phe Gin Pro His Thr Phe Thr Arg Thr lie Ala Leu Leu Asp Asp Phe 
325. 330 335 

Ala His Ala Leu Asn Gin Ala Asp Ala Val Tyr Leu Ala Gin lie Tyr 
340 , 345 350 

Gly Ser Ala Arg Glu Val" Asp His Gly Asp* Val Lys Val Glu Asp Leu 
355 360 365 

Ala Asn Lys lie Asn Lys Lys His > Gin Val lie Thr Val Glu Asn Val 
370 375 380 

Ser Pro Leu Leu Asp His Asp Asn Ala Val Tyr Val Phe Met Gly Ala 
385 390 395. 400 

Gly Asp lie Gin Thr Tyr Glu Tyr Ser Phe Glu Arg Leu Leu Ser Asn 
405 410' 415 

Leu Thr Ser Asn Val Gin - 
420 

(2) INFORMATION FOR SEQ ID NO: 117: 

{ i ) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 3121 base pairs 

(B) TYPE: nucleic acid 
"(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
TTTTAACCCA ACTGTTGGTA CTTTCCTTTT TACTGCAGGA TTGAGCTTGT TAGTTTTATT 
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GGTTTCTAAA AGGGAAAATG GAAAGAAACG AC TTGTTC AT TTTCTGCTGT TGACTAGCAT 12 0 

GGGAGTTCAA TTGTTGCCGG CCAGTGCTTT TGGGTTGACC - AGCCAGATTT TATCTGCCTA 180 

TAATAGTCAG CTTTCTATCG GAGTCGGGGA ACATTTACCA GAGCCTCTGA AAATCGAAGG 240 

TTATCAATAT ATTGGTTATA TCAAAACTAA GAAACAGGAT AATACAGAGC TTTCAAGGAC 3 00 

AGTTGATGGG AAATACTCTG CTCAAAGAGA TAGTCAACCA AACTCTACAA AAACATCAGA 3 60 

TGTAGTTCAT TCAGCTGATT TAGAATGGAA CCAAGGACAG GGGAAGGTTA GTTTACAAGG 420' 

TGAAGCATCA GGGGATGATG G AC TTTC AG A AAAATCTTCT ATAGCAGCAG ACAATCTATC 480 

TTCTAATGAT TCATTCGCAA GTCAAGTTGA GCAGAATCCG GATCACAAAG GAGAATCTGT 540 

AGTTCGACCA ACAGTGCCAG AACAAGGAAA TCCTGTGTCT GCTACAACGG TGCAGAGTGC 600 

GGAAGAGGAA GTATTGGCGA CGACAAATGA TCGACCAGAG TATAAACTTC CATTGGAAAC 660 

CAAAGGCACG CAAGAACCCG GTCATGAGGG TGAAGCCGCA GTCCGTGAAG ACTTACCAGT 720 

CTACACTAAG C C ACT AGAAA CCAAAGGTAC ACAAGGACCC GGACATGAAG GTGAAGCTGC 780 

AGTTCGCGAG G AAGAAC C AG CTTACACAGA ACCGTTAGCA AC GAAAGGCA CGCAAGAGCC 840 

AGGTCATGAG GGCAAAGCTA CAGTCCGCGA AGAGACTCTA GAGTACACGG AACCGGTAGC 900 

GACAAAAGGC ACACAAGAAC. CCGAACATGA GGGCGAaCGG sCAGTAGAAG: AAGAACTTCC 960 

GGCTTTAGAG GTCAC TAC AC GAAATAGAAC GGAAATC C AG AATATTCCTT ATACAACAGA 1020 

AGAAATTCAG GATCCAACAC TTCTGAAAAA TCGTCGTAAG ATTGAACGAC AAGGGCAAGC 1080 

AGGGACACGT ACAATTCAAT ATGAAGACTA CATCGTAAAT GGTAATGTCG TAGAAACTAA - 1140 

AGAAGTGTCA CGAACTGAAG TAGCTCCGGT CAACGAAGTC GTTAAAGTAG GAACACTTGT 1200 

GAAAGTTAAA CCTACAGTAG AAATTACAAA CTTAACAAAA GTTGAGAACA AAAAATCTAT 12 60 

AACTGTAAGT TATAACTTAA TAGACACTAC CTCAGCATAT GTTTCTGCAA AAACGCAAGT 1320 

TTTC C ATGGA GACAAGCTAG TTAAAGAGGT GGATATAGAA AATCCTGCCA AAGAGCAAGT 13 80 

AATATCAGGT TTAGATTACT ACACACCGTA TACAGTTAAA ACACACCTAA CTTATAATTT 1440 

GGGTGAAAAT AATGAGGAAA ATACTGAAAC ATCAACTCAA GATTTCCAAT TAGAGTATAA 1500 
GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA ATGATCGTTA _ _ 1560 

TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GACTGATACG GCTAAATACT TTGTAAAAGT 1620- 

GAAATCAGAT CGCTTCAAAG AAATGT AC CT ACCTGTAAAA TCTATTACAG AAAATAC GG A 1680 

TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC GAAGAAGGTA CAGACGGTTA 1740 

CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA GAGCAACCAG GAGTTTACAC 1800 

ATC CTTT AAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG TCTGGTGTCT ATACATTGGC . 1860 



TTCAGATATG -ACCGCAGATG AGGTGAGCTT AGGCGATAAG CAGACAAGTT ATCTCACAGG 1920 
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TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAACAAAA TCGTATGCCA TTTATGATTT 1980 

GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT AGAGATTTGG ATATTAAAAC 2040 

TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG AAGGCAGCGA ATAGCGCGAA 2100 

TATTAATAAT GTTGCAGTAG AAGGAAAAAT CTCAGGTGCG AAATCTGTTG CGGGATTAGT 2160 

AGC GAGCGC A ACAAATACAG TGATAGAAAA CAGCTCGTTT ACAGGGAAAC TTATCGCAAA 22 2 0 

TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA GGTAATATAA CAGGAAATAG 22 80 

TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT AC TAATGC AC GCAATAATAA 23 40 

CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT GCATTGATAT CTAATTCGGT 2400 

TGCTACTGGA GAAATACGAA ATGGTCAAGG AT ATTC TAG A GTCGGAGGAA TAGTAGGATC 2460 

TACGTGGCAA AACGGTCGAG TAAATAATGT TGTGAGTAAC GTAGATGTTG GAGATGGTTA 2520 

TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA^AATGCAAGTA""CATCAGTTGA~ " 258 0 

TAATAGAAAA GCAGACAGAT TCGCTACAAA ATTATCAAAA GACCAAATAG ACGCGAAAGT 2 640 

TGC TGATTAT GGAATCACAG TAACTCTTGA TGATACTGGG CAAGATTTAA AACGTAATCT 27 00 

AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT GAAAGAAAAG TAGCTTATAG 27 60 

CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA GTAGTTCACT ATGGTAACAA 2820 

AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA GATGTTGTGC CGATGAAAGA 2880 

TGATGAAGTA GTAACGGATA TTAATAATAA GAAAAATTCA ATAAATAAAG TTATGTTACA 2940 

TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC AAAGAAAACT TCATAAACAG 3 000 

TCAAGTAATC GAATACAATG TTACAGGAAA 'AGAATATATA TTC AC AC C AG AAGCATTTGT 3 060 

TTCAGACTAT ACAGCGATAA CGAATAACGT AC TAAGCGAC TTGCAAAATG TAACACTTAA 3120 

C 3121 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 amino acids 
'(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 

Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Thr Ala Gly Leu Ser Leu 
1. 5 10 15 . 

Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gly Lys Lys Arg Leu Val 

20 , 25 30 - 

. His Phe Leu Leu Leu Thr- Ser Met Gly Vai Gin Leu Leu Pro Ala Ser 

35 40 .... 45 



Ala Phe Gly Leu Thr Ser. Gin lie Leu Ser Ala Tyr Asn Ser Gin Leu 
50 55 60 



Ser lie Gly Val Gly Glu His Leu Pro Glu Pro Leu Lys lie Glu Gly— 
65 70 75 80 

Tyr Gin Tyr lie Gly Tyr lie Lys Thr Lys Lys Gin Asp Asn Thr Glu 
85 .90 "95 

Leu Ser Arg Thr Val Asp Gly Lys Tyr Ser Ala Gin Arg Asp Ser Gin 
. 100 105 . - 110 

Pro Asn Ser Thr Lys Thr Ser Asp Val Val His Ser Ala Asp Leu Glu 
115 120 125 

Trp Asn Gin Gly Gin Gly Lys Val Ser Leu Gin Gly Glu Ala Ser Gly 
130 - 135 140 

Asp Asp Gly Leu Ser Glu Lys Ser Ser lie Ala Ala Asp Asn Leu Ser 
145 150 155 x " "160 

Ser Asn Asp Ser Phe Ala Ser Gin Val Glu Gin Asn Pro Asp His Lys 
165 170 175 

Gly Glu Ser Val Val Arg Pro Thr Val Pro Glu Gin Gly Asn Pro Val 
, 180 185 190 

Ser Ala Thr Thr Val Gin Ser Ala Glu Glu Glu Val Leu Ala Thr Thr 
195 200 . v 205 

Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Glu Thr Lys Gly Thr Gin 
210 215 _ 220 

Glu Pro Gly His Glu Gly -Glu Ala Ala Val Arg Glu Asp Leu Pro Val 
225 230 235 240 

Tyr Thr Lys Pro Leu Glu Thr Lys Gly Thr Gin Gly Pro Gly His Glu 
245 ' 250 255 

Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Ala Tyr Thr Glu Pro Leu 
260 265 , 270 

Ala Thr Lys. Gly Thr Gin Glu Pro Gly His Glu Gly Lys Ala Thr Val 
275 280 285 

Arg Glu Glu Thr Leu Glu .Tyr Thr Glu Pro Val Ala Thr Lys Gly Thr 
290 295 300 

Gin Glu Pro Glu His Glu Gly Glu Arg Xaa Val Glu Glu Glu Leu Pro 
305 310 V 315 320. 

Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Glu lie Gin Asn lie Pro 
325 330 _ 335 

Tyr Thr Thr Glu Glu lie Gin Asp Pro Thr Leu Leu Lys Asn Arg Arg 
340 345 350 

Lys He Glu Arg Gin Gly Glh Ala Gly Thr Arg Thr He Gin Tyr Glu 
3 55 3 60 3 65 



Asp Tyr He Val Asn Gly Asn Val Val Glu -Thr Lys Glu Val. Ser Arg 
370 375 380 



^ 3 
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Thr Glu Val Ala Pro Val Asn Glu Val Val Lys Val Gly Thr Leu Val 
385 390 395 400 

. Lys Val Lys Pro Thr Val Glu lie Thr Asn Leu Thr Lys Val Glu Asn 
405 410 415 

Lys Lys Ser lie Thr Val Ser Tyr Asn Leu lie Asp Thr Thr Ser Ala 
420 . 425 430- 

Tyr Val Ser Ala Lys Thr Gin Val Phe His Gly Asp Lys- Leu Val Lys 
435- % 440 445 

Glu Val Asp lie Glu Asn Pro Ala Lys Glu Gin Val lie Ser Gly Leu 
450 455 460 

Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr His Leu. Thr Tyr Asn Leu 
465 470 . 475 ^ 480 

Gly Glu Asn Asn Glu Glu Asn Thr Glu Thr Ser Thr Gin Asp Phe Gin 
485 490 495 

Leu Glu Tyr Lys Lys lie Glu lie Lys Asp lie Asp Ser Val Glu Leu 
500 505 510 

Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg Tyr Leu Ser Leu Ser Glu 
515 520 525 

Ala Pro Thr Asp Thr Ala Lys Tyr Phe Val Lys Val Lys Ser Asp Arg 
530 535 540 

Phe Lys Glu Met Tyr Leu Pro Val Lys. Ser lie Thr Glu Asn Thr Asp 
545 550 555 560 

Gly Thr Tyr Lys Val Thr Val Ala Val Asp Gin Leu Val Glu Glu Gly 
565 570 . 575 

Thr Asp Gly Tyr Lys Asp Asp Tyr Thr Phe Thr Val Ala Lys Ser Lys 
580 585 590 

Ala Glu Gin Pro Gly Val Tyr Thr Ser Phe Lys Gin Leu Val Thr Ala 
595 600 605 

Met Gin Ser Asn Leu Ser Gly Val Tyr Thr Leu Ala Ser Asp Met Thr 
610 615 620 

Ala Asp Glu Val Ser Leu Gly Asp Lys Gin Thr Ser Tyr Leu Thr Gly 
625 630 635 640 

Ala Phe Thr Gly Ser Leu lie Gly Ser Asp Gly Thr Lys Ser Tyr Ala 
645 650 v 655 

lie Tyr Asp Leu Lys Lys Pro Leu Phe Asp Thr Leu Asn Gly Ala Thr 
660 665 670 

Val Arg Asp Leu Asp lie Lys Thr Val Ser Ala Asp Ser Lys Glu Asn 
675 680 685 

Val Ala Ala Leu Ala Lys Ala Ala Asn Ser Ala Asn lie Asn Asn Val 
690 69.5 700 

Ala Val Glu Gly Lys He Ser Gly Ala Lys Ser Val Ala Gly Leu~Val 
705 710 715 720 
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Ala Ser Ala Thr Asn Thr Val lie Glu Asn Ser Ser Phe Thr Gly Lys 
725 730 . 735 

Leu lie Ala Asn His Gin Asp Ser Asn - Lys Asn Asp Thr Gly Gly lie 
740 745 750 

Val Gly Asn lie Thr Gly Asn Ser Ser Arg Val Asn Lys Val Arg Val 
755 760 765 

Asp Ala Leu lie Ser Thr Asn Ala Arg Asn Asn Asn Gin Thr Ala Gly 
. 770- 775 780 

Gly lie Val Gly Arg Leu Glu Asn Gly Ala Leu lie Ser Asn Ser Val 
785 790 795 800 

• Ala Thr Gly Glu lie Arg Asn Gly Gin Gly Tyr Ser Arg Val" Gly Gly 
805 810 815 

lie Val Gly Ser Thr Trp Gin. Asn Gly Arg Val Asn Asn Val Val Ser 
820 825 830 

Asn Val Asp Val Gly Asp Gly Tyr Val lie Thr Gly Asp Gin Tyr Ala 
835 840 845 

Ala Ala Asp Val Lys Asn Ala Ser Thr Ser Val Asp Asn Arg Lys Ala 
850 855 860 

Asp Arg Phe Ala Thr Lys Leu Ser Lys Asp Gin lie Asp Ala Lys Val 
865 - 870 ' 875 880 

Ala Asp Tyr Gly lie Thr Val Thr Leu Asp Asp Thr Gly Gin Asp Leu . 

885 890 895 

Lys Arg Asn . Leu Arg Glu Val Asp Tyr Thr Arg Leu Asn Lys Ala Glu 
900 905 ' 910 

Ala Glu Arg Lys Val Ala Tyr Ser Asn lie Glu Lys Leu Met Pro Phe 
915 920 925 

Tyr Asn Lys Asp Leu Val Val His Tyr Gly Asn Lys Val Ala Thr Thr 
930 935 . 940 

Asp Lys Leu Tyr Thr Thr Glu Leu Leu Asp Val Val Pro Met Lys Asp 
- 945 950 955 960 

Asp Glu Val Val Thr Asp lie Asn Asn Lys Lys Asn Ser lie Asn Lys 
965 970 975 

Val Met Leu His Phe Lys Asp Asn Thr Val Glu Tyr Leu Asp Val Thr 
980 985 990 

Phe Lys Glu Asn Phe lie Asn Ser Gin Val lie Glu Tyr Asn Val Thr 
995 1000 1005- 

Gly Lys Glu Tyr lie Phe Thr Pro Glu Ala Phe Val Ser Asp Tyr Thr 
1010 1015 1020 

Ala lie Thr Asn Asn Val Leu Ser Asp Leu Gin Asn Val Thr Leu Asn 
~ — 1025 1030 1035 1040 

(2) INFORMATION FOR SEQ ID NO: 119: 
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( i ) SEQUENCE- CHARACTERISTICS : 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TTTTAAC C C A ACTGTTGGTA CTTTCCTTTT TACTGCAGGA TTGAGCTTGT TAGTTTTATT 60 

GGTTTCTAAA AGGGAAAATG GAAAGAAACG ACTTGTTCAT TTTCTGC TGT TGACTAGCAT 120 

GGGAGTTCAA TTGTTGCCGG CCAGTGCTTT TGGGTTGACC AGCCAGATTT TATCTGCCTA 18 0 

TAATAGTCAG CTTTCTATCG GAGTCGGGGA AC ATTT AC C A GAGC CTCTGA AAATCGAAGG 24 0 

TTATCAATAT ATTGGTTATA TCAAAACTAA GAAAC AGGAT AATACAGAGC TTTC AAGGAC 3 00 

AGTTGAT GGG AAATACTCTG CTCAAAGAGA TAGTCAACCA AACTCTACAA AAACATCAGA 3 60 

TGTAGTTCAT TCAGCTGATT TAGAATGGAA C C AAGGAC AG GGGAAGG TT A GTTTACAAGG 42 0 

TGAAGCATCA GGGGATG ATG GACTTTCAGA AAAATCTTCT ATAGCAGCAG ACAATCTATC 48 0 

TTCTAATGAT TCATTCGCAA GTCAAGTTGA GCAGAATCCG GATCACAAAG GAGAATCTGT 540 

AGTTCGACCA AC AGTGC C AG AACAAGGAAA TCCTGTGTCT GCTACAACGG TGCAGAGTGC 600 

GGAAGAGGAA GTATTGGCGA CGACAAATGA TCGACCAGAG TATAAAC TTC CATTGGAAAC 660 

CAAAGGCACG CAAGAACCCG GTCATGAGGG TGAAGC CGC A GTCC GTGAAG ACTTACCAGT 720 

CTACACTAAG CCACTAGAAA C C AAAGGT AC ACAAGGACCC. GGACATGAAG GTGAAGCTGC 780 

AGTTC GC GAG GAAGAAC C AG C TT AC AC AG A ACCGTTAGCA ACGAAAGGCA CGCAAGAGCC 840 

AGGTCATGAG GGCAAAGCTA CAGTCCGCGA AGAGACTCTA GAGTAC AC GG AACCGGTAGC 900 

GAC AAAAGGC ACACAAGAAC CCGAACATGA GGGCGAaCGG sCAGTAGAAG AAGAACTTCC 960 

GGC TTTAGAG GTCACTACAC GAAATAGAAC GGAAATCCAG AATATTC C TT ATACAACAGA 1020 

AGAAATTCAG GATCCAACAC TTC TGAAAAA TCGTCGTAAG ATTGAACGAC AAGGGCAAGC 1080 

AGGGACACGT ACAATTCAAT ATG AAGAC T A CATCGTAAAT GGTAATGTCG TAGAAACTAA 1140 

AGAAGTGTCA CGAACTGAAG TAGCTCCGGT CAACGAAGTC GTTAAAGTAG GAACACTTGT 120 0 

GAAAGTTAAA CCTACAGTAG AAATTACAAA CTTAACAAAA GTTGAGAACA AAAAATCTAT 1260 

AAC TGTAAGT TATAACTTAA TAG AC AC T AC CTCAGCATAT GTTTCTGCAA AAACGCAAGT 1320 
TTTCCATGGA GACAAGCTAG TTAAAGAGGT GGATATAGAA AATCCTGCCA AAGAGCAAGT ' 1380 

AATATCAGGT TTAGATTACT ACACACCGTA TACAGTTAAA ACACACCTAA CTTATAATTT 1440 

GGGTGAAAAT AATGAGGAAA ATACTGAAAC ATCAACTCAA GATTTCCAAT TAGAGTATAA 1500 

GAAAATAGAG ATTAAAGATA TTGATTCAGT AGAATTATAC GGTAAAGAAA ATGATCGTTA 1560 

TCGTAGA . 1567 
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(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) "LENGTH: 522 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Phe Asn Pro Thr Val Gly Thr Phe Leu Phe Thr Ala Gly Leu Ser Leu 
1 5 10 ^ 15 

Leu Val Leu Leu Val Ser Lys Arg Glu Asn Gly Lys Lys Arg Leu Val 
.20 25 30 



"ay ' ' His Phe Leu Leu Leu Thr Ser Met Gly Val Gin Leu Leu Pro Ala Ser 

Sj. -35 . 40 45 

Ala Phe Gly Leu Thr Ser Gin lie Leu Ser Ala Tyr Asn Ser Gin Leu 

50 55 60 

'.^ Ser lie Gly Val Gly Glu His Leu Pro Glu Pro Leu Lys lie Glu Gly 

H 65. 70 75 80 

O Tyr Gin Tyr lie Gly Tyr lie Lys Thr Lys Lys Gin Asp Asn Thr Glu 

i=~ 85 . - 90 95 

Leu Ser Arg Thr Val Asp Gly Lys Tyr - Ser Ala Gin Arg Asp Ser Gin 

.ji ' 100 105 110 

^ Pro Asn Ser Thr Lys Thr Ser Asp Val Val His Ser Ala Asp Leu Glu 

115 120 ; . 125 

Trp Asn Gin Gly Gin Gly Lys Val Ser Leu Gin Gly Glu Ala Ser Gly 

• 130 135: , 140 

Asp Asp Gly Leu Ser Glu Lys Ser Ser lie Ala Ala Asp Asn Leu Ser 

145 150 155 160 

Ser Asn Asp Ser Phe Ala Ser Gin Val Glu Gin Asn Pro Asp His Lys 

- 165- 170 175 

Gly Glu Ser Val Val Arg Pro Thr Val Pro Glu Gin Gly Asn Pro Val 

180 185 ' 190 

.Ser Ala Thr Thr Val Gin Ser Ala Glu Glu Glu Val Leu Ala Thr Thr 

195 , 200 205 

Asn Asp Arg Pro Glu Tyr Lys Leu Pro Leu Glu Thr Lys Gly Thr Gin 

210 215 220 

Glu Pro Gly His Glu Gly Glu Ala Ala Val Arg Glu Asp Leu Pro Val 

225 - 230 " 235 240 

Tyr Thr Lys Pro Leu-Gl-u— 1 Thr— Lys-Gly Thr Gin' Gly Pro Gly His Glu 

- 245 250 255 

Gly Glu Ala Ala Val Arg Glu Glu Glu Pro Ala Tyr Thr Glu Pro Leu 
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. 260 265 270 

• Ala Thr Lys Gly Thr Gin Glu Pro . Gly His Glu Gly Lys Ala Thr Val" 

275 280 285 

Arg Glu Glu Thr Leu Glu Tyr Thr Glu Pro Val Ala Thr Lys Gly Thr 
290 . . 295 300 

Gin Glu Pro Glu His Glu Gly Glu Arg Xaa Val Glu Glu Glu Leu Pro 
. 305 . 310 315 320 

Ala Leu Glu Val Thr Thr Arg Asn Arg Thr Glu lie Gin Asn lie Pro 
325 330 335 

Tyr Thr Thr Glu Glu lie Gin Asp Pro Thr Leu Leu Lys Asn Arg Arg 
340 345 350 

* Lys lie Glu Arg Gin Gly Gin Ala Gly Thr Arg Thr lie Gin Tyr Glu 

355 3'60 365 

'Asp~~Tyr~Il"e""VaT"Asn Gly Asn Val Val Glu Thr Lys Glu Val Ser Arg 
370 375 380 

Thr Glu Val Ala Pro Val Asn Glu Val Val Lys Val Gly Thr Leu Val 
385 390 395 400 

Lys Val Lys Pro Thr Val Glu lie Thr Asn Leu Thr Lys Val Glu Asn 
405 410 415 

Lys Lys Ser lie Thr Val Ser Tyr Asn Leu lie Asp Thr Thr Ser Ala 
420 425 * 430 

Tyr Val Ser Ala Lys Thr Gin Val Phe His Gly Asp Lys Leu Val Lys 
435 440 445 

Glu Val Asp lie Glu Asn Pro Ala Lys Glu Gin Val lie Ser Gly Leu 
450 . 455 460 

Asp Tyr Tyr Thr Pro Tyr Thr Val Lys Thr His Leu Thr Tyr Asn Leu 
465 470 475 480 

Gly Glu Asn Asn Glu Glu Asn . Thr Glu Thr Ser Thr Gin Asp Phe Gin 
485 490 495 

Leu Glu Tyr Lys Lys lie Glu lie Lys Asp lie Asp Ser Val Glu Leu 
V. 500 505 510 

Tyr Gly Lys Glu Asn Asp Arg Tyr Arg Arg 
515 520 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID . NO : 121: 

TCGTAGATAT TTAAGTCTAA GTGAAGCGCC GAC TGATACG GCTAAATACT TTGTAAAAGT 
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GAAATCAGAT CGCTTCAAAG AAATGTACCT ACCTGTAAAA 
TGGAACGTAT AAAGTGACGG TAGCCGTTGA TCAACTTGTC 
CAAAGATGAT TACACATTTA CTGTAGCTAA ATCTAAAGCA 
ATC C TTT AAA CAGCTGGTAA CAGCCATGCA AAGCAATCTG 
TTCAGATATG ACCGCAGATG AGGTGAGCTT AGGC GATAAG 
TGCATTTACA GGGAGCTTGA TCGGTTCTGA TGGAAC AAAA 
GAAGAAACCA TTATTTGATA CATTAAATGG TGCTACAGTT 
TGTTTCTGCT GATAGTAAAG AAAATGTCGC AGCGCTGGCG 
TATTAATAAT GTTGCAGTAG ■ AAGGAAAAAT CTCAGGTGCG 
AGCGAGCGCA ACAAATACAG TGATAGAAAA AGCTC GTTT 
TCACCAGGAC AGTAATAAAA ATGATACTGG AGGAATAGTA 
TTCGAGAGTT AATAAAGTTA GGGTAGATGC CTTAATCTCT 
CCAAACAGCT GGAGGGATAG TAGGTAGATT AGAAAATGGT 
TGCTACTGGA GAAATACGAA ATGGTCAAGG ATATTCTAGA 
TACGTGGCAA AACGGTCGAG TAAATAATGT ■ TGTGAGTAAC 
TGTTATCACC GGTGATCAAT ACGCAGCAGC AGATGTGAAA 
TAATAGAAAA GCAGACAGAT TCGCTACAAA ATT ATC AAAA 
TGCTGATTAT GGAATCACAG TAAC TCTTGA* TGATAC TGGG 
AAGAGAAGTT GATTATACAA GACTAAATAA AGCAGAAGCT 
CAACATAGAA AAACTGATGC CATTCTACAA TAAAGACCTA 
AGTAGCGACA ACAGATAAAC TTTACACTAC AGAATTGTTA 
TGATGAAGTA GTAAC GGATA TTAATAATAA GAAAAATTCA 
TTTCAAAGAT AATACAGTAG AATACCTAGA TGTAACATTC 
TCAAGTAATC GAATACAATG TTACAGGAAA AGAATATATA 
TTCAGACTAT ACAGCGATAA CGAATAACGT ACTAAGCGAC 
C 

(2) INFORMATION FOR SEQ ID NO: 122: 

. ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid; 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear. 



TCTATTACAG AAAAT AC GG A ■ 12 0 

GAAGAAGGTA CAGACGGTTA 18 0 

GAGCAACCAG GAGTTTACAC 24 0 

TCTGGTGTCT ATACATTGGC 3 00 

CAGACAAGTT ATCTCACAGG 360 

TCGTATGCCA TTTATGATTT 42 0 

AGAGATTTGG ATATTAAAAC 48 0 

AAGGCAGCGA ATAGCGCGAA 540 

AAATCTGTTG CGGGATTAGT 600 

ACAGGGAAAC TTATCGCAAA 660 

GGTAATATAA CAGGAAATAG 72 0 

ACTAATGCAC GCAATAATAA 78 0 

GCATTGATAT CTAATTCGGT 840 

GTCGGAGGAA TAGTAGGATC 900 

GTAGATGTTG GAGATGGTTA 960 

AATGCAAGTA CATCAGTTGA 1020 

GACCAAATAG AC GC GAAAGT 1080 
CAAGATTTAA AACGTAATCT . 1140 

GAAAGAAAAG TAGCTTATAG 1200 

GTAGTTCACT ATGGTAACAA 12 60 

GATGTTGTGC CGATGAAAGA 132 0 

ATAAATAAAG TTATGTTAC A 138 0 

AAAGAAAACT TCATAAACAG 1440 

TTCACACCAG AAGCATTTGT 1500 

TTGCAAAATG TAAC AC TT AA 1560 

1561 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

Arg Arg Tyr Leu Ser Leu Ser Glu Ala Pro Thr Asp Thr Ala Lys Tyr 
1 5 10 . 15 

Phe Val Lys Val Lys Ser_.Asp Arg Phe Lys Glu Met Tyr Leu Pro Val 
20 25 30 

•Lys-, Ser lie Thr Glu Asn Thr Asp Gly Thr Tyr Lys Val Thr Val Ala 

35 40 . . - 45 

Val Asp Gin Leu Val Glu Glu Gly Thr Asp Gly Tyr Lys Asp Asp Tyr 
50 55 60 

Thr Phe Thr Val Ala Lys Ser Lys Ala Glu Gin Pro Gly Val Tyr Thr 
65 70 .75 80 

Ser Phe Lys Gin Leu Val Thr Ala Met Gin Ser Asn Leu Ser Gly Val 

85 90^ 95 

Tyr Thr Leu Ala Ser Asp Met Thr Ala Asp Glu Val Ser Leu Gly Asp 
100 105 110 

Lys Gin Thr Ser Tyr Leu Thr Gly Ala Phe Thr Gly Ser Leu lie Gly 
115 120 125 

Ser Asp Gly Thr Lys Ser Tyr Ala lie Tyr Asp Leu Lys Lys Pro Leu 
130 135 140 

Phe Asp Thr Leu Asn Gly Ala Thr Val Arg Asp Leu Asp lie. Lys Thr 
145 150 1^55 160 

Val Ser Ala Asp Ser Lys Glu Asn Val "Ala Ala Leu Ala Lys Ala Ala 
165 170 175 

Asn Ser Ala Asn lie Asn Asn Val Ala Val Glu' Gly Lys lie Ser Gly 
180 185 190 

Ala Lys Ser Val Ala -Gly Leu Val Ala Ser Ala Thr Ash Thr Val He 
195 ' . 200 205 

Glu Asn Ser Ser Phe Thr Gly Lys Leu He Ala Asn His Gin Asp Ser 
210 215 220 

Asn Lys Asn Asp Thr Gly Gly He Val Gly Asn He Thr Gly Asn Ser 
225 230 235 240 

Ser Arg Val- Asn Lys Val Arg Val Asp Ala Leu He Ser Thr Asn Ala 

245 250 . - 255. 

Arg Asn Asn Asn Gin Thr Ala Gly Gly lie Val Gly Arg Leu Glu Asn 
260 265 " 270 

Gly Ala Leu He Ser Asn Ser Val Ala Thr Gly Glu He Arg Asn Gly 
275 - 280 285 

Gin Gly Tyr Ser Arg Val Gly Gly He Val Gly.. Ser Thr Trp Gin Asn 
290 295 . 300 



Gly Arg Val Asn Asn Val Val Ser-Asn Val Asp Val Gly Asp Gly Tyr 
305 310- 315 - 320- 
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Val lie Thr Gly Asp Gin Tyr Ala Ala Ala Asp Val Lys' Asn Ala Ser 
325 330 335 

Thr Ser Val Asp Asn Arg Lys Ala Asp Arg Phe Ala Thr Lys Leu Ser - 
340 345 —350 

Lys Asp Gin lie Asp Ala Lys Val Ala Asp Tyr Gly lie Thr Val Thr 
355 360 365 

Leu Asp Asp Thr Gly Gin Asp Leu Lys Arg Asn Leu Arg Glu Val Asp 
370 ' 375 380 

Tyr Thr Arg Leu Asn Lys Ala Glu Ala Glu Arg Lys Val Ala Tyr Ser 
385 390 395 400. 

Asn lie Glu Lys Leu Met Pro Phe Tyr Asn Lys Asp Leu Val Val His 
405 410 . 415 

Tyr Gly Asn Lys Val Ala Thr Thr Asp Lys Leu Tyr Thr Thr Glu Leu 
420 425 430 



Leu Asp Val Val Pro Met Lys Asp Asp Glu Val Val Thr Asp lie Asn 
435 440 445 

Asn Lys Lys Asn Ser lie Asn Lys Val Met Leu His Phe Lys Asp Asn 
450 f 455 460 

Thr Val Glu Tyr Leu Asp Val Thr Phe Lys Glu Asn Phe lie Asn Ser 
465 470 . 475 480 

Gin Val lie Glu Tyr Asn Val Thr Gly Lys Glu Tyr lie Phe Thr Pro 
485 490 495 



Glu Ala Phe. Val Ser Asp Tyr Thr Ala lie Thr Asn Asn Val Leu Ser 
500 505 510 



Asp Leu Gin Asn Val Thr Leu Asn 
515 . 520 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 850 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : doubie 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CTTTGGTTTT GAAGGAAGTA AGC GTGGAC A ATTTGC TGTA GAAGGAATCA ATCAACTTCG 6.0 

TGAGCATGTA GACACTCTAT TGATTATCTC AAACAACAAT TTGCTTGAAA TTGTTGATAA 120 

GAAAACACCG CTTTTGGAGG CTCTTAGC GA AGCGGATAAC GTTC TTCGTC AAGGTGTTC A 18 0 

AGGGATTACC GATTTGATTA CCAATCCAGG ATTGATTAAC CTTGACTTTG CCGATGTGAAJ 240 

AACGGTAATG GCAAACAAAG GGAATGCTCT TATGGGTATT GGTATCGGTA GTGGAGAAGA 3 00 

ACGTGTGGTA GAAGCGGCAC GTAAGGCAAT CTATTCACCA CTTCTTGAAA CAACTATTGA ^ 3 60 
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CGGTGCTGAG GATGTTATCG TCAACGTTAC TGGTGGTCTT GACTTAACCT TGATTGAGGC 42 Q 

AGAAGAGGCT TCACAAATTG TGAACCAGGC AGCAGGTCAA GGAGTGAACA TCTGGCTCGG 480 

TACTTCAATT GATGAAAGTA TGC GTGATGA AATTCGTGTA ACAGTTGTTG CAACGGGTGT 54 0- 

TCGTCAAGAC CGCGTAGAAA AGGTTGTGGC TCCACAAGCT AGATC TGCTA CTAACTACCG 600 

" TGAGACAGTG AAACCAGCTC ATTCACATGG CTTTGATCGT CATTTTGATA TGGCAGAAAC 660 

AGTTGAATTG CCAAAACAAA ATCCACGTCG TTTGGAACCA ACTCAGGCAT CTGCTTTTGG - 720 

TGATTGGGAT CTTCGCCGTG AATCGATTGT TCGTACAACA GATTCAGTCG TTTCTCCAGT 78 0 

CGAGCGCTTT GAAGCCCCAA TTTCACAAGA TGAAGATGAA TTGGATACAC CTCCATTTTT 840 

CAAAAATCGT 850 
(2) INFORMATION FOR SEQ ID NO: 124:' 

(i) SEQUENCE CHARACTERISTICS : - 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Phe Gly Phe Glu Gly Ser Lys Arg Gly Gin Phe Ala Val Glu Gly lie 
1 5 10 15 

Asn Gin Leu Arg Glu His Val Asp Thr Leu Leu lie lie Ser Asn Asn 
20 25 30 

Asn Leu Leu Glu He Val Asp Lys Lys Thr Pro Leu Leu Glu Ala Leu 
35 40 45 

Ser Glu Ala Asp Asn Val Leu Arg Gin Gly Val Gin Gly He Thr Asp 
50 .55 60 

Leu lie Thr Asn Pro Gly Leu lie Asn Leu Asp Phe Ala Asp Val Lys 
65 70 75 ' 80 • 

Thr Val Met Ala Asn Lys Gly Asn Ala Leu Met Gly He Gly He Gly 
85 90 95 

Ser Gly Glu Glu Arg Val Val Glu Ala Ala Arg Lys Ala He Tyr Ser 
100 105 110 . 

Pro Leu Leu Glu Thr Thr He Asp Gly Ala Glu Asp Val lie Val Asn 
115 120 12-5 . 

Val Thr Gly Gly Leu Asp Leu Thr Leu He Glu Ala Glu Glu Ala Ser 
130 135 140 

Gin He Val Asn Gin Ala Ala Gly Gin Gly Val Asn He Trp Leu Gly 
145 150 155 160. 



Thr Ser He Asp Glu Ser Met Arg Asp Glu lie- Arg- Val Thr Val Val 

165 ■ 170 ...JL75 
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Ala Thr Gly Val Arg Gin Asp Arg Val Glu Lys Val Val Ala Pro Gin 
180 • 185 190 

Ala Arg Ser Ala Thr Asn Tyr Arg Glu Thr -Val Lys Pro Ala His Ser 
195 200 205 

His Gly Phe Asp Arg His Phe Asp Met Ala Glu Thr Val Glu Leu _ Pro 
210 ' 215 220 

Lys Gin Asn Pro Arg Arg Leu Glu Pro Thr Gin Ala Ser Ala Phe Gly 
225 230 235 > 240 

Asp Trp Asp Leu Arg Arg Glu Ser lie Val Arg Thr Thr Asp Ser Val 
245 250 255 

Val Ser Pro Val Glu Arg Phe Glu Ala Pro lie Ser Gin Asp Glu Asp 
260 . 265 270 

Glu Leu Asp Thr Pro Pro Phe Phe Lys Asn Arg 
275 280 

(2) INFORMATION FOR SEQ ID NO : 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: 1051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

CTACTACCTC TCGAGAGAAA GTGAC CTAGA GGTGACCGTT TTTGACCATG AGCAAGGTCA 60 

AGCCACCAAG GCCGCAGCAG GAATTATCAG TCCTTGGTTT TCCAAACGCC GTAATAAAGC 12 0 

CTGGTACAAG ATGGCGCGCT TGGGGGCTGA TTTTTATGTG GATTTATTAG CTGATTTAGA 18 0 

GAAATCAGGA CAAGAAATCG ACTTTTACCA GCGTTCGGGA GTCTTTCTCT TGAAAAAGGA 240 

TGAATCCAAT TTGGAAGAAC TTTATCAACT GGCCCTCCAG CGCAGAGAAG AATCTCCCTT 3 00 

GATAGGGCAA TTAGC C ATTC TGAACCAAGC CTCAGCTAAT GAATTATTCC CTGGTTTGCA 3 60 

GGGATTTGAC CGCCTGCTCT ATGCTTCTGG TGGAGCGAGA GTAGATGGCC AACTTTTAGT 420 

GACTCGTTTG CTGGAAGTCA GTCATGTCAA GCTGGTCAAA GAAAAAGTGA CTCTGACACC 480 

GTTAGCATC A GGCTACCAGA TTGGTGAAGA GGAGTTTGAG CAGGTTATTT TGGCGACGGG 540 

AGCTTGGTTG GGGGACATGT TAGAGCCTTT AGGTTATGAA GTGGATGTCC GTCCTCAAAA 600 

AGGACAACTA CGAGATTATC AGCTTGCCCA AGACATGGAA GATTACCCTG TTGTCATGCC 660 

AGAAGGGGAG TGGGATTTGA TTCCCTTTGC AGGTGGGAAA TTATCCTTAG GCGCTACCCA 720 

C G AAAATGAC ATGGGATTTG ATTTGACGGT AGATGAAACC TTGCTCCAAC AAATGGAGGA 78 0 

GGCCACCTTG ACTCACTATC - TGATTTTGGC TGAAGC TACT TCAAAATCTG AGCGTGTTGG .840 

AATCCGTGCC T AC AC C AGTG ATTTCTCTCC TTTC TTTGGG CAGGTGCCTG AC TT AACTGG 900 
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TGTCTATGCA GCCAGTGGAC TAGGTTCATC AGGCCTCACA ACTGGTCCTA TCATTGGTTA ■ .960 

CCATCTAGCC CAACTGATCC AAGACAAGGA GTTGACCTTG GACCCTCTAA ATTACCCAAT .1020 

TGAAAACTAT GTCAAACGAG TAAAAAGCGA A . 1051 
(2) INFORMATION FOR SEQ ID NO:126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: protein' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 

hQ Tyr Tyr Leu Ser Arg Glu Ser Asp Leu Glu Val Thr Val Phe Asp His 

%j 1 5 10 15 

i"f% Glu Gin Gly Gin Ala Thr Lys Ala Ala Ala Gly lie lie Ser Pro Trp 

J 20- 25 3 0 

^ Phe Ser Lys Arg Arg Asn Lys Ala Trp Tyr Lys Met Ala Arg Leu Gly 

H 35 40 45^ 

O Ala Asp Phe Tyr Val Asp Leu Leu Ala Asp Leu Glu Lys Ser Gly Gin 

U 50 55 60 

ITU ■ ■ ' 

pi Glu lie Asp Phe Tyr Gin Arg Ser Gly Val Phe Leu Leu Lys Lys Asp 

65 70 75 80 

"isd - 

H -Glu Ser Asn Leu Glu Glu Leu Tyr Gin Leu Ala Leu Gin Arg Arg Glu 

85 90 95 

Glu Ser Pro Leu lie Gly Gin Leu Ala lie Leu Asn Gin Ala Ser Ala 
100 105 110 

Asn Glu Leu Phe Pro Gly Leu Gin Gly Phe Asp Arg Leu Leu Tyr Ala 
115 120 ' 125 . 

Ser Gly Gly Ala Arg Val Asp Gly Gin Leu Leu Val Thr Arg Leu Leu 
130 135 140 

Glu Val Ser His Val Lys Leu Val' Lys Glu Lys Val Thr Leu. Thr Pro 
145 150 155 160 

Leu Ala Ser Gly Tyr Gin lie Gly Glu Glu Glu Phe Glu Gin Val lie 
165 ' 170 175 

Leu Ala Thr Gly Ala Trp Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr 
180 185 190 

Glu Val Asp Val Arg Pro Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu 
195 200 205 



-Al-a-Gln Asp^ Met Glu Asp Tyr Pro Val Val Met Pro Glu Gly Glu Trp 

210 ■ - ~ - - 215 220 

Asp, Leu lie Pro Phe Ala Gly Gly Lys Leu Ser Leu Gly Ala Thr His 
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225 230 235 240 

Glu Asn Asp Met Gly Phe Asp Leu- Thr Val Asp Glu Thr Leu Leu Gin 
245 250 255 

Gin Met Glu Glu Ala Thr Leu Thr His Tyr Leu lie Leu Ala Glu Ala 
. 260 265 270 

Thr Ser Lys Ser Glu Arg Val Gly lie Arg Ala Tyr Thr Ser Asp Phe 
275 280 285 

Ser Pro Phe Phe Gly Gin Val Pro Asp Leu Thr Gly Val Tyr Ala Ala 
290 295 300 

Ser Gly Leu Gly Ser Ser Gly Leu Thr Thr Gly Pro lie lie Gly Tyr 
305 310 315 ' 320 

His Leu Ala Gin Leu lie Gin Asp Lys Glu Leu Thr Leu Asp Pro Leu 
325 330 335 

Asn Tyr Pro lie Glu Asn Tyr Val Lys Arg Val Lys Ser Glu 
340 345 350 ' 

(2) INFORMATION FOR SEQ ID NO : 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GCTAGAAAAG ATTGGAGCTG ACTCGGTTAT 
CTCGCCAGAG TATGAAATGG GGCAGTCTCT AGCACAGACC ATTC TTTTC C ATAATAGTGT 
TGATGTC TTT CAGTTGGATA AAAATGTGTC TATCGTGGAG ATGAAAATTC CTCAGTCTTG 
GGCAGGTCAA AGTCTGAGTA AATTAGACCT CCGTGGCAAA TACAATCTGA ATATTTTGGG 
TTTCCGAGAG CAGGAAAATT CCCCATTGGA TGTTGAATTT GGACCAGATG ACCTCTTGAA 
AGCAGATACC TATATTTTGG CAGTCATCAA CAACCAGTAT TTGGATACCC TA 
(2) INFORMATION FOR SEQ ID NQ:128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



Lys Val Lys Ser Gin Thr Ala Lys Lys Val Leu Glu Lys lie Gly. Ala 
1 5 10 15 
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Asp Ser Val He Ser Pro Glu Tyr Glu Met Gly Gin Ser Leu Ala Gin 
20 25 30 

Thr He Leu Phe His Asn Ser Val Asp Val Phe Gin Leu Asp Lys Asn 
35 - -40 45 

Val Ser He Val Glu Met Lys 'lie Pro Gin Ser Trp Ala Gly Gin Ser 
50 55 60 

Leu Ser Lys Leu Asp Leu Arg Gly Lys Tyr Asn Leu Asn He Leu Gly 
65 70 ,75 80 

Phe Arg Glu Gin Glu Asn Ser Pro Leu Asp Val Glu Phe Gly Pro Asp 
85 90 95 

Asp Leu Leu Lys Ala Asp Thr Tyr He Leu Ala Val He Asn Asn Gin 
100 105 110 

Tyr Leu Asp Thr Leu 

'" 115" 



(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129:' 

TGACGGGTCT CAGGATCAGA CTCAGGAAAT CGCTGAGTGT TTAGCTAGCA AGTATCCTAA 60 

TATCGTTAGA GCCATCTATC AGGAAAATAA ATGCCATGGC GGTGCGGTCA ATCGTGGCTT 120 

GGTAGAGGCT TCTGGGCGCT ATTTTAAAGT AGTTGACAGT GATGACTGGG TGGATCCTCG 180 
TGCCTACTTG AAAATTCTTG AAACTTGCAG GAACTTGAGA GCAAAGGTCA AGAGGTGGAT : 240 

GTCTTTG 247 
(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino .acids 

(B) TYPE: ainino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Asp Gly Ser Gin Asp Gin Thr Gin Glu He Ala Glu Cys Leu Ala Ser 
1 5 10 15 

-Lys Tyr Pro Asn He Val— Arg— Ala—He-Tyr-Gln Glu Asn Lys Cys His 

— 20 25 - • " ' 30 

. Gly, Gly Ala Val Asn Arg Gly Leu Val Glu Ala Ser Gly Arg .Tyr Phe 
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35 40 45 

Lys Val Val Asp Ser Asp Asp Trp Val Asp Pro Arg Ala Tyr Leu Lys 
50 - 55 60 

lie Leu Glu Thr Cys Arg Asn Leu Arg Ala Lys Val Lys Arg Trp Met 
"65 .70 75 80 

Ser Leu 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: — 

(A) LENGTH: 1744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131':- 

TAGAGGCTTT GCCAAATGGT GGGAAGGGCA CGAGCGTCGA AAAGAGGAAC GCTTTGTCAA 60 

ACAAGAAGAA AAAGCTCGCC AAAAGGCTGA GAAAGAGGCT AGATTAGAAC AAGAAGAGAC 120 

TGAAAAAGCC TTACTCGATT TGCCTCCTGT TGATATGGAA AC GGGTGAAA TTCTGACAGA 180 

GGAAGCTGTT CAAAATCTTC CACCTATTCC AGAAGAAAAG TGGGTGGAAC CAGAAATCAT 240 

CCTGCCTCAA GCTGAACTTA AATTCCCTGA ACAGGAAGAT GACTCAGATG ACGAAGATGT 3 00 

TCAGGTCGAT TTTTCAGCCA AAGAAGC C CT TGAATACAAA CTTCCAAGCT TACAACTCTT 3 60 

TGCACCAGAT AAACCAAAAG ATCAGTCTAA AGAGAAGAAA ATTGTCAGAG AAAATATCAA 420 

AATCTTAGAA GCAACCTTTG CTAGCTTTGG TATTAAGGTA ACAGTTGAAC GGGCCGAAAT 480 

TGGGCCATCA GTGACCAAGT ATGAAGTCAA GCCGGCTGTT GGTGTAAGGG TCAACCGCAT 540 

TTCCAATCTA TCAGATGACC TCGCTCTAGC CTTGGCTGCC AAAGATGTCC GGATTGAAGC 600 

ACCAATCCCT GGGAAATCCC TAATCGGAAT TGAAGTGCCC AACTCCGATA TTGCCACTGT 660 

ATCTTTCCGA GAACTATGGG AACAATCGCA AACGAAAGCA GAAAATTTCT TGGAAATTCC 720 

TTTAGGGAAG GCTGTTAATG GAACCGCAAG AGCTTTTGAC CTTTCTAAAA TGCCCCACTT 780 

GCTAGTTGCA GGTTCAACGG GTTCAGGGAA GTCAGTAGCA GTTAACGGCA TTATTGCTAG 840 

CATTCTCATG AAGGC GAGAC CAGATCAAGT TAAATTTATG ATGGTCGATC CCAAGATGGT 900 

TGAGTTATCT GTTTACAATG ATATTCCCCA CCTCTTGATT CCAGTCGTGA CCAATCCACG 960 

CAAAGCCAGC AAGGCTCTGC AAAAGGTTGT GGATGAAATG GAAAACCGTT ATGAACTCTT - 102 0 

TGCCAAGGTG GGAGTTCGGA ATATTGCAGG TTTTAATGCC AAGGTAGAAG AGTTC AATTC 108 0 

CCAGTCTGAG TACAAGCAAA TTCCGCTACC ATTCATTGTC GTGATTGTGG— ATGAGTTGGC , 1140 

TGACCTCATG ATGGTGGCCA GCAAGGAAGT GGAAGATGCT ATCATCCGTC TTGGGCAGAA 1200 
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GGCGCGTGCT GCAGGTATCC ACATGATTCT TGCAACTCAG CGTCCATCTG TTGATGTCAT 12 60 

CTCTGGTTTG ATTAAGGCCA ATGTTCCATC TCGTGTAGCA TTTGC GGTTT CATCAGGAAC 132 0 

AGACTCCCGT ACGATTTTGG ATGAAAATGG AGCAGAAAAA CTTCTTGGTC GAGGAGACAT 1380 

GCTCTTTAAA CCGATTGATG • AAAATCATCC AGTTCGTCTC CAAGGCTCCT TTATCTCGGA 1440 

TGACGATGTT GAGCGCATTG TGAACTTCAT CAAGACTCAG GCAGATGCAG ACTACGATGA 1500 

GAGTTTTGAT CCAGGTGAGG TTTCTGAAAA TGAAGGAGAA TTTTCGGATG GAGATGCTGG 1560 

TGGTGATCCG CTTTTTGAAG AAGCTAAGTC TTTGGTTATC GAAACACAGA AAGCCAGTGC 162 0 

GTCTATGATT CAGCGTCGTT TATCAGTTGG ATTTAACCGT GCGACCCGTC TCATGGAAGA 168 0 

ACTGGAGATA GCAGGTGTCA TCGGTCCAGC TGAAGGTACC AAACCTCGAA AAGTGTTACA 1740 

^ ACAA 1744 

*D (2) INFORMATION FOR S EQ " ID NO : 1 3 2 T~ " 

rn (i) SEQUENCE CHARACTERISTICS : 

|ji (A) LENGTH: 581 amino acids 

jjij (B) TYPE: amino acid 

(C) STRANDEDNESS : single 
,7 (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 2: 

Arg Gly Phe Ala Lys Trp Trp Glu Gly His Glu Arg Arg Lys Glu Glu 
.1 .5 10 15 . 

Arg Phe Val Lys Gin Glu Glu Lys Ala Arg Gin Lys Ala Glu Lys Glu 
20 25 30 

Ala Arg Leu Glu Gin Glu Glu Thr Glu Lys Ala Leu Leu Asp Leu Pro 
35 40 45 

Pro Val Asp Met Glu Thr Gly Glu lie Leu Thr Glu Glu Ala Val Gin 

50 55 60- .... - 

Asn Leu Pro Pro lie Pro Glu Glu Lys Trp Val Glu Pro Glu lie lie 
65 70 75 80. 

Leu Pro .Gin Ala Glu Leu Lys Phe Pro Glu Gin Glu Asp Asp Ser Asp 
85 90 95 

Asp Glu Asp Val Gin Val Asp Phe Ser Ala Lys Glu Ala Leu Glu Tyr 

100. / "~105 110 

Lys Leu Pro Ser Leu Gin Leu Phe Ala Pro Asp Lys Pro Lys Asp Gin 
115 120 - 125 

Ser Lys Glu Lys Lys lie Val Arg Glu Asn lie Lys "lie Leu Glu Ala 
130 135 " 140 



Thr Phe Ala Ser Phe- Gly lie Lys Val Thr -Val Glu Arg Ala Glu He 
145 " . "- ' 150 155 160 
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Gly Pro Ser Val Thr Lys Tyr Glu Val Lys Pro Ala Val Gly Val Arg 
■ ' 165 170 175 

Val Asn Arg lie Ser Asn Leu Ser Asp Asp Leu Ala Leu Ala Leu Ala 
180 185 190 

Ala Lys Asp Val Arg lie Glu .Ala Pro lie Pro Gly Lys. Ser Leu lie 
195 200 205 

Gly lie Glu Val Pro Asn Ser Asp lie Ala Thr Val- Ser Phe Arg Glu 
210 215 . 220 

Leu Trp Glu Gin Ser Gin Thr Lys Ala Glu Asn Phe Leu Glu lie Pro 
225 230 235 . 240 

Leu Gly Lys Ala Val Asn Gly Thr Ala Arg. Ala. Phe Asp Leu Ser Lys 
245 250 255. 

Met Pro His Leu Leu Val Ala Gly -Ser Thr Gly Ser Gly . Lys Ser Val 

260 ......... , 265 . , 270 

Ala Val Asn Gly lie lie Ala Ser lie Leu Met Lys Ala Arg Pro Asp 

275 280 . . . 285 

Gin Val Lys Phe Met Met Val Asp Pro Lys Met Val Glu Leu Ser Val 
290 295 300 

Tyr Asn Asp lie Pro His Leu Leu lie Pro Val Val Thr Asn Pro Arg 
305 310 315 320 

Lys Ala Ser Lys Ala Leu Gin Lys Val Val Asp Glu Met Glu Asn Arg 
325 330 335 

Tyr Glu Leu Phe Ala Lys Val Gly Val Arg Asn lie Ala Gly Phe Asn 
340 345 350 

Ala Lys Val Glu Glu Phe Asn Ser Gin Ser Glu Tyr Lys Gin lie Pro 
355 360 365 

Leu Pro Phe lie Val Val lie Val Asp Glu Leu Ala Asp Leu Met Met 
370 375 , 380 

Val Ala Ser Lys Glu Val Glu Asp Ala lie lie Arg Leu Gly Gin Lys 
385 390 395 400 

Ala Arg Ala Ala Gly lie His Met lie Leu Ala Thr Gin Arg Pro Ser 
405 410 415 

Val Asp Val lie Ser Gly Leu lie Lys Ala Asn Val Pro Ser Arg Val 
420 425 430 

Ala Phe Ala Val Ser Ser Gly Thr Asp Ser Arg Thr lie Leu Asp Glu 
435 440 445 

Asn Gly Ala Glu Lys Leu Leu Gly Arg Gly Asp Met Leu Phe Lys Pro 
450 455" 460- 

lie Asp Glu Asn His Pro Val Arg Leu Gin Gly Ser Phe_Ile Ser Asp 
465 470. - 475 - " . 480 

Asp Asp Val Glu Arg lie Val Asn Phe "lie ~Lys Thr Gin Ala Asp Ala 
* ~485- 490 . 495 



# 
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Asp Tyr Asp Glu Ser Phe Asp Pro Gly Glu Val Ser Glu Asn Glu Gly 
500 - 505 510 . 

Glu Phe Ser Asp Gly Asp Ala Gly Gly Asp Pro Leu Phe Glu Glu Ala 
515 520 525 

Lys Ser Leu Val lie Glu Thr Gin. Lys Ala Ser Ala.. Ser Met lie Gin 
530 535 540 

Arg Arg Leu Ser Val Gly Phe, Asn Arg Ala Thr Arg Leu Met Glu Glu 
545 ' 550 555 / 560 

Leu Glu lie Ala Gly Val lie Gly Pro Ala Glu Gly Thr Lys Pro Arg 
565- 570 575 

Lys Val Leu Gin Gin 
580 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS : - .' 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133:- 

TCAAAAAGAG AAGGAAAACT TGGTTATTGC TGGGAAAATA GGTC C AGAAC CAGAAATTTT 60 

GGCCAATATG TATAAGTTGC TGATTGAAGA AAATACCAGC ATGACTGCGA CTGTTAAACC 120 

r 

GAATTTTGGG AAGACAAGCT TCCTTTATGA AGC TCTGAAA AAAGGCGATA TTGACATCTA 180 

TCCTGAATTT ACTGGTACGG TGACTGAAAG TTTGCTTCAA CCATCACCCA AGGTGAGTCA 240 

TGAACCAGAA CAGGTTTATC AGGTGGCGCG TGATGGCATT GCTAAGCAGG ATCATCTAGC 3 00 

CTATCTCAAA CCCATGTCTT ATCAAAACAC CTATGCTGTA GCTGTTCCGA AAAAGATTGC 3 60 

TCAAGAATAT GGC TTGAAGA CCATTTCAGA CTTGAAAAAA GTGGAAGGGC AGTTGAAGGC 420 

AGGTTTTACA CTCGAGTTTA ACGACCGTGA AGATGGAAAT AAGGGCTTGC AATCAATGTA 480 

TGGTCTCAAT CTCAATGTAG CGACCATTGA GCCAGCCCTT CGCTATCAGG CTATTCAGTC . 540 

AGGGGATATT CAAATCACGG ATGCCTATTC .GACTGATGCG GAATTGGAGC GTTATGATTT 600 

AC AGGTC TTG GAAGATGACA AGCAACTCTT CCCACCTTAT CAAGGGGCTC CACTCATGAA 660 

AGAAGCTCTT CTCAAGAAAC ACCCAGAGTT GGAAAGAGTT CTTAATACAT TGGCTGGTAA "'720 

GATTACAGAA AGC C AGATGA GCCAGCTCAA CTACCAAGTC GGTGTTGAAG GCAAGTCAGC 78 0 

AAAGCAAGTA GCCAAGGAGT TTCTCCAAGA ACAAGGTTTG TTGAAGAAA 829 

(2) INFORMATION FOR SEQ ID NO:134: - _ 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: 

Gin Lys Glu Lys Glu Asn Leu Val lie Ala Gly Lys lie Gly Pro Glu 
1 5 10 15 

Pro Glu lie Leu Ala Asn Met Tyr Lys Leu Leu lie Glu Glu Asn Thr 
20 25 -30 

Ser Met Thr Ala Thr Val Lys Pro Asn Phe Gly Lys Thr Ser Phe Leu 
35 40 45 

Tyr Glu Ala Leu Lys Lys Gly Asp lie Asp lie Tyr Pro Glu Phe Thr 
50 55 60 

Gly Thr Val Thr Glu Ser Leu Leu Gin Pro Ser Pro Lys Val Ser His 
65 70 75 80 

Glu Pro Glu Gin Val Tyr Gin Val Ala Arg Asp Gly lie Ala Lys Gin 
85 } 90 95 

Asp His Leu Ala Tyr Leu Lys Pro Met Ser Tyr Gin Asn Thr Tyr Ala 
100 105 110 

Val Ala Val Pro Lys Lys lie Ala Gin Glu Tyr Gly Leu Lys Thr lie 
115 120 125 . 

Ser Asp Leu Lys Lys Val Glu Gly Gin Leu Lys Ala Gly Phe Thr Leu 
130 135 ' 140 

Glu Phe Asn Asp Arg Glu Asp Gly Asn Lys Gly Leu Gin Ser Met Tyr 
145 150 155 160 

Gly Leu Asn Leu Asn Val Ala Thr lie Glu Pro Ala Leu Arg Tyr Gin 
165 170- 175 

Ala lie Gin Ser Gly Asp lie Gin lie Thr Asp Ala Tyr Ser Thr Asp 
180 185 190 

Ala Glu Leu Glu Arg Tyr Asp Leu Gin Val Leu Glu Asp Asp Lys Gin 
195 200 205 

Leu Phe Pro Pro Tyr Gin Gly Ala Pro Leu Met Lys Glu Ala Leu Leu 
210 215 220 

Lys Lys His Pro Glu Leu Glu Arg Val Leu Asn Thr Leu Ala Gly Lys 
225 230 235 240 

lie Thr Glu Ser Gin Met Ser Gin Leu Asn Tyr Gin Val Gly Val, Glu 
245 250 ' 255 

Gly Lys Ser Ala Lys Gin Val Ala Lys Glu Phe Leu Gin Glu Gin Gly 
260 265 270 

Leu Leu Lys Lys 

275 



(2) INFORMATION FOR SEQ ID NO : 13 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

ACGTTCTATT GAGGACCACT TTGATTCAAA CTTCGAATTG GAATATAACC TCAAAGAAAA 60 

AGGGAAAACA GATCTTTTGA AGCTAGTTGA TAAAACAACT. GACATGCGTC TGCATTTTAT 12 0 • 

CCGCCAAACT CATCCACGCG GTCTCGGAGA TGCTGTTTTG CAAGCCAAGG CTTTCGTCGG 18 0 

AAATGAACCT TTTGTC GTT A TGC TTGGTGA TGACTTGATG GATATCACAG AC GAAAAGGC 2 40" . 

G TGTTCCACTT ACCAAACAAC TCATGGATGA CTACGAGCGT ACCCACGCGT CTACTATCGC 3 00 

%j TGTCATGCCA GTCCCTCATG ACGAAGTATC TGCTTACGGG GTTATTGCTC CGCAAGGCGA 3 60 

jjjl 

■fn AGGAAAAGAT GGTCTTTACA GTGTTGAAAC CTTTGTTGAA AAACCAGCTC CAGAGGACGC 42 0 

TCCTAGCGAC CTTGCTATTA TCGGACGCTA CCTCCTCACG CCTGAAATTT TTGAGATTCT 48 0 

CGAAAAGCAA GCTCCAGGTG CAGGAAATGA AATTCAGCTG ACAGATGCAA TCGACACCCT 540 

Q CAATAAAACA CAACGTGTAT TTGCTCGTGA GTTCAAAGGG GCTCGTTACG ATGTCGGAGA 600 

jjlj CAAGTTTGGC TTCATGAAAA CATCCATCGA CTACGCCCTC AAACACCCAC AAGTCAAAGA 66 0 

TGATTTGAAG AATTAC CTCA TCCAACTTGG AAAAGAATTG ACTGAGAAGG AA " 712 ~ 

H (2) INFORMATION FOR SEQ ID NO: 13 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO:136: 

Arg Ser lie Glu Asp His Phe Asp Ser Asn Phe Glu Leu Glu-Tyr Asn 
1.5 10 15 

Leu Lys Glu Lys Gly Lys Thr Asp Leu Leu Lys Leu Val Asp Lys Thr 
20 25 30 

Thr Asp Met Arg Leu His Phe lie Arg Gin Thr His Pro Arg Gly Leu 
35 40 45 

Gly Asp Ala Val Leu Gin Ala Lys Ala Phe Val Gly Asn Glu Pro Phe 
50 55 60 



Val Val Met Leu Gly Asp Asp Leu Met Asp lie Thr Asp Glu Lys Ala 
65 70 75 80 
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Val Pro Leu Thr Lys Gin Leu Met Asp Asp Tyr Glu Arg Thr His Ala 
85 90 • 95 

Ser Thr lie Ala Val Met Pro Val Pro His Asp Glu Val Ser Ala Tyr 
100 105 110 

• Gly Val lie Ala Pro Gin Gly Glu Gly Lys Asp Gly Leu Tyr Ser Val 
115 120 . 125 

Glu Thr Phe Val Glu Lys Pro Ala Pro Glu Asp Ala Pro Ser Asp Leu 
130 135 140 

Ala lie lie Gly Arg Tyr Leu Leu Thr .Pro Glu He Phe Glu lie Leu 
— 145 . . 150 155 160 

Glu Lys Gin Ala Pro Gly Ala Gly Asn Glu He Gin Leu Thr Asp Ala 
165 170 175 

He Asp Thr Leu Asn Lys Thr Gin Arg Val Phe Ala Arg Glu Phe Lys 
180 185 190 

Gly Ala Arg Tyr Asp Val Gly Asp Lys Phe Gly Phe Met Lys Thr Ser 
195 200 205 

He Asp Tyr Ala Leu Lys His Pro Gin Val Lys Asp Asp Leu Lys Asn 
210 215 220 

Tyr Leu He Gin Leu Gly Lys Glu Leu Thr Glu Lys Glu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

CGCTCAAAAT AC C AG AGGTG TTCAGCTAAT CGAGCACGTT TCTCCTCAAA TGTTGAAAGC 60 

CCAATTGGAG AGTGTCTTTT CTGATATTCC AC CTC AGGCT GTAAAAACTG GAATGTTGGC 120 

TACTACTGAA .ATCATGGAAA TCATCCAACC CTATCTTAAA AAACTGGATT GTCCCTATGT 180 

CCTTGATCCT GTTATGGTTG CTACAAGTGG AGATGCCTTG ATTGACTCAA ATGCTAGAGA 240 

CTATCTCAAA ACAAACTTAC TACCTCTAGC AACTATTATT ACGCCAAATC TTCCTGAAGC 3 00 

AGAAGAGATT GTTGGTTTTT CAATCCATGA CCCCGAAGAC ATGCAGCGTG CTGGTCGCCT 3 60 

GATTTTAAAA GAATTTGGTC CTCAGTCTGT GGTTATCAAA GGCGGACATC TCAAAGGTGG 420 

TGC TAAAGAT TTCCTCTTTA CCAAGAATGA ACAATTTGTC TGGGAAAGCC C AC GAATTC A 480 

AAC CTGTC AC AC CCATGGTA CT 502 



(2) INFORMATION FOR SEQ ID NO:138: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: 

. Ala Gin Asn Thr Arg Gly Val Gin Leu lie Glu His Val Ser Pro Gin 
1 5 10 15 

Met Leu Lys Ala Gin Leu Glu Ser Val Phe Ser Asp lie Pro Pro Gin 
20 25 30 

- Ala Val Lys Thr Gly Met Leu Ala Thr Thr Glu lie Met Glu lie lie 
35 40 - 45 

Gin Pro Tyr Leu Lys Lys Leu Asp Cys Pro Tyr Val Leu Asp Pro Val 
50 55 60 

Met Val Ala Thr Ser Gly Asp Ala Leu lie Asp Ser Asn Ala Arg Asp 
65 70 75 80 

Tyr Leu Lys Thr Asn Leu Leu Pro Leu Ala Thr lie lie Thr Pro Asn 
85 ,90 95 

Leu Pro Glu Ala Glu Glu lie Val Gly Phe Ser lie His Asp Pro Glu 
100 105 110 

Asp Met Gin' Arg Ala Gly Arg Leu lie Leu Lys Glu Phe Gly Pro Gin 
115 120 . ^ 125 

Ser Val Val lie Lys Gly Gly His Leu Lys Gly Gly Ala Lys Asp Phe 
130 135 .140 

Leu Phe Thr Lys Asn Glu Gin Phe Val Trp Glu Ser Pro Arg lie Gin 
145 150 155 160 

Thr Cys His Thr His Gly Thr 
165 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

AATTGTACAA TTAGAAAAAG ATAGC AAATC " AGACAAAGAA CAAGTTGATA AACTATTTGA 60 

ATCATTTGAT GCATCTTCAG ATGAATCTAT TTCTAAATTA AAAGAACTAT CTGAAACTTC 120 

ACTTAAAACC GATGC AGGTA~ AAGACTATCT - TAATAACAAA GTCAAAGAAT CATCTAAAGC 180 

AATTGTAGAT TTTCATTTGC AAAAAGGTTT GGCTTATGAT GTTAAAGATT CAGATGACAA .240 
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ATTTAAAGAT AAAGCAACTC TTGAAACAAA TGTAAAAGAA ATTACAAAAC AAATTGATTT 300 

TATCAAAAAA G TTG ATGAAA CTTTTAAACA AGAGAATTTG GAAGAAACTC TTAAATCTCT 360 

-AAATGATCTT GTTGATAAAT ATCAAAAACA AATCGAACTT TTGAAGAAAG AAGAAGAAAA 42 0 

AGCTGCTGAA AAAGCTGCTG AAAAAGCAAA GGAATCTTCT AGTCAAAGTA ATTCTTCTGG 48 0 

TAGTGCTTCT ' AATGAGTCTT ATAATGGATC TTCCAATTCA AATGTAGJiTT ATAGTTCATC 540 

TGAACAAACT AATGGATATT CAAATAATTA TGGCGGTCAA GATTATTCTG GTTCAGGAGA ' 600 

TAGTTCAACA AATGGTGGAT CATCAGAACA ATATTCATCT AGCAATTCAA ACAGCGGAGC 660 

AAATAATGTC TACAGATATA AAGGCACTGG TGCTGACGGC TATCAAAGAT ACTACTACAA 72 0 

AGATCATAAT AATGGAGATG TGTATGATGA CGATGGAAAT TACCTTGGGA AC TTTGGTGG 78 0 

CGGCATTGCA GAACCTAGTC AACGC 8 05 



^IJ (2) INFORMATION FOR SEQ ID NO: 140: 

''Si 

m (i) SEQUENCE CHARACTERISTICS: 

|j? "(A) LENGTH: 268 amino acids 

jjia (B) TYPE: amino acid 

3/1 . (C) STRAND EDNESS : single 

.J; (D) TOPOLOGY: linear 

s j_ (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

lie Val Gin Leu Glu Lys Asp Ser Lys Ser Asp Lys Glu Gin Val Asp 
1 5 10 15 

Lys Leu Phe Glu Ser Phe Asp Ala Ser Ser Asp Glu Ser lie Ser Lys 
20 25 30 

Leu Lys Glu Leu Ser Glu Thr Ser Leu Lys Thr Asp Ala Gly Lys Asp 

35 40 45 

Tyr Leu Asn Asn Lys Val Lys Glu Ser Ser Lys Ala lie Val Asp Phe 
50 55 60 

His Leu Gin Lys Gly Leu Ala Tyr Asp Val Lys Asp Ser Asp Asp Lys ' 
.65 70 75 80 

Phe Lys Asp Lys Ala Thr Leu Glu Thr Asn Val Lys Glu lie Thr Lys . 

85 90 95 

Gin lie Asp Phe lie Lys Lys Val Asp. Glu Thr Phe Lys Gin Glu Asn 

100 " " 105 110 

Leu Glu Glu Thr Leu Lys Ser Leu Asn Asp Leu Val Asp Lys Tyr Gin 

115 120 " 125 

Lys„.Gln lie Glu Leu Leu Lys Lys Glu Glu Glu Lys Ala Ala Glu Lys 
130 135 140 



Ala Ala Glu Lys Ala Lys Glu Ser Ser Ser Gin Ser Asn Ser Ser Gly 
145 - 150 . • * 155 160 
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Ser Ala Ser Asn Glu Ser Tyr Asn Gly Ser Ser Asn Ser Asn Val Asp 
165 170 175 

Tyr Ser Ser Ser Glu Gin Thr Asn Gly Tyr Ser Asn Asn Tyr Gly Gly 
'—180 185 190 

Gin Asp Tyr Ser Gly Ser Gly Asp Ser Ser Thr Asn Gly Gly Ser Ser 

195 200 205 

Glu Gin Tyr Ser Ser" Ser Asn Ser Asn Ser Gly Ala Asn Asn Val Tyr 
210 215 220" 

Arg Tyr Lys Gly Thr Gly Ala Asp Gly Tyr Gin Arg Tyr Tyr Tyr Lys 
225 230 235 — 240 

Asp His Asn Asn Gly Asp Val Tyr Asp Asp Asp Gly Asn Tyr -Leu Gly 
245 250 255, 

Asn Phe Gly Gly Gly lie Ala Glu Pro Ser Gin Arg 

' 2 60 . 265 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141:. 
TCTGACCAAG CAAAAAGAAG CAGTCAATGA CAAAGGAAAA GCAGCTGTTG TTAAGGTGGT 
GGAAAGCCAG GCAGAACTTT ATAGCTTAGA AAAGAATGAA GATGCTAGCC TAAGAAAGTT 
ACAAGCAGAT GGACGCATCA CGGAAGAACA GGCTAAAGCT TATAAAGAAT ACAATGATAA 
AAATGGAGGA GCAAATCGTA AAGTCAATGA T 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Leu Thr Lys Gin Lys Glu Ala Val Asn Asp Lys Gly Lys Ala Ala Val 
1 5 10 15 

Val Lys Val Val Glu Ser Gin Ala Glu Leu Tyr Ser Leu Glu Lys Asn 
20 - 25 . 30 



Glu "Asp— Al'a Ser Leu Arg Lys Leu Gin Ala Asp .Gly Arg lie Thr Glu 

— 35 40 . — ■ 45 
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Glu Gin Ala Lys Ala .Tyr Lys Glu Tyr Asn Asp Lys Asn Gly Gly Ala 

"" 50 / 55 60 



Asn Arg Lys Val Asn Asp 
65 70 



(2). INFORMATION FOR SEQ ID NO: 143: 

( i) . SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double . 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC GGTAGAGGAA CAGATTTTCT TTATGGAGTT 60 

* 

TGAAGAACTC T ATC GGG AAA CCCAAAAACG CAGTGTAGCC AGTCAGCAAA AGACTAGTCT 12 0 

G AAC TT AG AT GGGCAGACGC TTAGCAATGG CAGTCAAAAG TTGCCAGTCC CTAAAGGAAT 18 0 

TCAGGCCCCA TCAGGCCAAA GTATTACATT TGACCGAGCT GGGGGCAATT CGTCCCTGGC 240 

TAAGGTTGAA TTTCAGACCA GTAAAGGAGC GATTCGCTAT CAATTATATC TAGGAAATGG 3 00 

AAAAATTAAA CGCATTAAGG AAACAAAAAA T 331 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) .SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:144: 

Ser Gly Ser Val Gin Ser Thr Phe Ser Ala Val Glu Glu Gin lie Phe 
1 5 10 15 

Phe Met Glu Phe Glu Glu Leu Tyr Arg Glu Thr Gin Lys Arg Ser Val 



20 



25 



30 



Ala Ser Gin Gin Lys Thr 

. 35 



Ser 



Leu Asn Leu Asp Gly Gin Thr Leu Ser 
40 45 



Asn Gly Ser Gin Lys Leu 
50 



Pro 
55 



Val Pro Lys Gly lie Gin Ala Pro Ser 
60 



Gly Gin Ser lie Thr Phe 
65 70 



Asp 



Arg Ala Gly Gly Asn Ser Ser Leu Ala 
75 80 



Lys Val Glu Phe Gin Thr 
85 



Ser 



Lys Gly. Ala lie Arg Tyr Gin Leu Tyr 

_. 90 95 •' - 



Leu Gly Asn Gly Lys lie Lys Arg lie Lys Glu Thr Lys Asn 

100; 105 .... 110 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 6 base pairs 

(B) TYPE: - nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GGGACAAATT CAAAAAAATA GGCAAGAGGA AGCAAAAATC TTGC AAAAGG AAGAAGTCTT 
GAGGGTAGCT AAGATGGCCC TGCAGACGGG GCAAAATCAG GTAAGCATCA ACGGAGTTGA 
GATTCAGGTA TTTTC TAGTG AAAAAGGATT GGAGGTCTAC CATGGTTCAG AACAGTTGTT 

GGCAATCAAA GAGCCA 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: 

Gly Gin lie Gin Lys Asn Arg Gin Glu Glu Ala 'Lys lie Leu Gin Lys 

1 . 5 - 10 15 . 

Glu Glu Val Leu Arg Val. Ala Lys Met Ala Leu Gin Thr Gly Gin Asn 
20 25 ' 30 

Gin Val Ser lie Asn Gly Val Glu lie Gin Val Phe Ser Ser Glu Lys 
35 40 45 

Gly Leu Glu Val Tyr His Gly Ser Glu Gin Leu Leu Ala lie Lys Glu 
50 55 60 

Pro 
65 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TCGCTACCAG CAACAAAGCG AGCAAAAGGA GTGGCTCTTG TTTGTGGACC AACTTGAGGT 
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AGAATTAGAC CGTTCGCAGT TCGAAAAAGT AGAAGGCAAT CGCCTATACA" TGAAGCAAGA 120 

TGGCAAGGAC ATCGCCATCG GTAAGTCAAA GTCAGATGAT TTCCGTAAAA CGAATGCTCG 180 
TGGTCGAGGT TATCAGCCTA TGGTTTATGG ACTGAAATCT GT AC GG ATT A CAGAGGACAA — 240 

TCAACTGGTT CGCTTTCATT TCCAGTTCCA AAAAGGCTTA GAAAGGGAGT TCATCTATCG .3 00 

TGTGGAAAAA GAAAAAAGT 319 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
-(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Arg Tyr Gin Gin Gin Ser Glu Gin Lys Glu Trp Leu Leu Phe Val Asp 
15 10 15 

Gin Leu Glu Val Glu Leu Asp Arg Ser Gin Phe Glu Lys Val Glu Gly 
20 25 30 

Asn Arg Leu Tyr Met Lys Gin Asp Gly Lys Asp lie Ala lie Gly Lys 
35 40 45 

Ser Lys Ser Asp Asp Phe Arg Lys Thr Asn Ala Arg Gly Arg Gly Tyr 
50 55 " 60 

Gin Pro Met Val Tyr Gly Leu Lys Ser Val Arg lie Thr Glu Asp Asn 
65 70 75 . 80 

Gin Leu Val Arg Phe His Phe Gin Phe Gin Lys Gly Leu Glu Arg Glu 
85 90 95 

Phe lie Tyr Arg Val Glu Lys Glu Lys Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

GAACCGACAA GTCGCCCACT ATCAAGACTA TGCTTTGAAT AAAGAAAAAT TGGTTGC TTT. 60 

TGCTATGGCT AAACGAACCA AAGATAAGGT TGAGCAAGAA AGTGGGGAAC AGTTTTTTAA 12 0 

TCTAGGTC AG GTAAGCTATC AAAACAAGAA AACTGGCTTA GTGACGAGGG TTC GTACGGA 18 0 



TAAGAGCCAA TATGAGTTTC TGTTTCCTTC AGTCAAAATC AAAGAAGAGA .AAAGAGATAA 240 
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AAAGGAAGAG GTAGCGACCG ATTCAAGCGA AAAAGTGGAG AAGAAAAAAT CAGAAGAGAA 
GCCTGAAAAG AAAGAGAATT CA - . 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: iinear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ- ID NO:150: 

Asn Arg Gin Val Ala His Tyr Gin Asp Tyr Ala Leu Asn Lys Glu Lys 
1 5 10 15 

Leu Val Ala Phe Ala Met Ala Lys Arg Thr Lys Asp Lys Val Glu Gin 
20 25 30 

Glu Ser Gly Glu Gin Phe Phe Asn Leu Gly Gin Val Ser Tyr Glh Asn 
35 40 45 

Lys Lys Thr Gly Leu Val Thr Arg Val Arg Thr Asp Lys Ser Gin Tyr 
. 50 55 60 

Glu Phe Leu Phe Pro Ser Val Lys lie Lys Glu Glu Lys Arg Asp Lys 
65 70 -75 80 

Lys Glu Glu Val Ala Thr Asp Ser Ser Glu Lys Val Glu Lys Lys Lys 
85 90 95 

Ser Glu Glu Lys Pro Glu Lys Lys Glu Asn Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 151:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
GGTTGTC GGC TGGCAATATA TCCCGTTTCC ATCTAAAGGT AGTACAATTG GTCCTTACCC 
AAATGGTATC AGATTAGAAG GTTTTCCAAA GTCAGAGTGG TACTACTTCG ATAAAAATGG 
AGTGCTACAA GAGTTTGTTG GTTGGAAAAC ATTAGAGATT AAAACTAAAG ACAGTGTTGG 
AAGAAAGTAC GGGGAAAAAC GTGAAGATTC AGAAGATAAA GAAGAGAAGC GTTATTATAC 
GAACTATTAC TTTAATCAAA ATCATTCTTT AGAGACAGGT TGGCTTTATG ATCAGTCTAA 
CTGGTATTAT CTAGCTAAGA CGGAAATTAA TGGAGAAAAC TACCTTGGTG GTGAAAGACG 



259 

TGCGGGGTGG ATAAACGATG ATTCGACTTG GTACTACCTA GATCCAACAA CTGGTATTAT 42 0 

GCAAACAGGT TGGCAATATC TAGGTAATAA GTGGT AC T AC CTCCGTTCCT CAGGAGCAAT 480 

GGCCACTGGC TGGTATCAGG AAGGT AC C AC TTGGTATTAT TTAGACCACC CAAATGGCGA 540 

TATGAAAACA GGTTGGCAAA ACCTTGGGAA CAAATGGTAC TATCTCCGTT CATCAGGAGC 600 

TATGGCAACT GGTTGGTATC AAGATGGTTC AACTTGGTAC TACC TAAATG CAGGTAATGG 6 60 

AGACATGAAG ACAGGTTGGT TCCAGGTCAA TGGCAACTGG TACTATGCTT ATAGCTCAGG 72 0 

TGCTTTGGCA GTGAATACGA CCGTAGATGG CTATTCTGTC AACTATAATG GCGAATGGGT 78 0 

TCGG 784 
(2) INFORMATION FOR SEQ ID NO: 152: . . 

. (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein - 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Val Val Gly Trp Gin Tyr lie Pro Phe Pro Ser Lys Gly Ser Thr lie 
1 5 10 15 

Gly Pro Tyr Pro Asn Gly lie Arg Leu Glu Gly Phe Pro Lys Ser Glu 
20 25 30 

Trp Tyr Tyr Phe Asp Lys Asn Gly Val Leu Gin Glu Phe Val Gly Trp 
35 40 45 

Lys Thr Leu Glu lie Lys Thr Lys Asp Ser Val Gly Arg Lys Tyr Gly 
50 55 60 

"Glu Lys Arg- Glu Asp Ser Glu Asp Lys Glu Glu Lys Arg Tyr Tyr Thr 
65 70 75 80 

Asn Tyr Tyr Phe Asn Gin Asn His Ser Leu Glu Thr Gly Trp Leu Tyr 
85 90 95 

■ 1 

■ Asp Gin Ser Asn Trp Tyr Tyr Leu Ala Lys Thr Glu lie Asn Gly Glu 
100 105' 110 

Asn Tyr Leu Gly Gly Glu Arg Arg Ala Gly Trp lie Asn Asp Asp Ser 
115 120 125 

- Thr Trp Tyr Tyr Leu Asp Pro Thr Thr Gly lie Met Gin Thr Gly Trp 
130 135 140 

Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met 
145 150 155 . 160 

— Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp His 

. 165 ■ 170 175 

Pro, Asn Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp 



■ : - * 
260 

180 185 190 

Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp 
" 195 200 205 

Gly Ser Thr Trp Tyr Tyr Leu Asn Ala Gly Asn Gly Asp Met Lys Thr 
210 215 220 

Gly Trp Phe Gin Val Asn Gly Asn Trp Tyr Tyr Ala Tyr Ser Ser Gly 
225 230 235 240 

Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Ser Val Asn Tyr Asn 
245 250 255 

Gly Glu Trp Val Arg 

260 ^ 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 1708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

GGCCAAATCA GAATGGGTAG AAGACAAGGG AGCCTTTTAT TATC TTGACC AAGATGGAAA 6 0 

GATGAAAAGA AATGCTTGGG TAGGAAC TTC CTATGTTGGT GCAACAGGTG CCAAAGTAAT 12 0 

AGAAGACTGG GTCTATGATT CTCAATACGA TGCTTGGTTT TATATCAAAG CAGATGGACA 18 0 

GCACGCAGAG AAAGAATGGC TCCAAATTAA AGGGAAGGAC TATTATTTCA AATCCGGTGG 240 

TTATCTACTG ACAAGTCAGT GGATTAATCA AGCTTATGTG AATGCTAGTG GTGCCAAAGT " 3 00 

ACAGCAAGGT TGGCTTTTTG ACAAACAATA CCAATCTTGG TTTTACATCA AAGAAAATGG 3 60 

AAACTATGCT GATAAAGAAT GGATTTTCGA GAATGGTCAC TATTATTATC TAAAATCCGG 42 0 

TGGCTACATG GCAGCCAATG AATGGATTTG GGATAAGGAA TCTTGGTTTT ATCTCAAATT 48 0 

TGATGGGAAA ATGGCTGAAA AAGAATGGGT CTACGATTCT CATAGTCAAG CTTGGTACTA 540 

CTTCAAATCC GGTGGTTACA TGACAGCCAA TGAATGGATT . TGGGATAAGG AATCTTGGTT 600 

TTATCTCAAA TCTGATGGGA AAATAGCTGA AAAAGAATGG GTCTACGATT CTCATAGTCA 660 

AGC TTGGTAC TACTTCAAAT CCGGTGGTTA CATGACAGCC AATGAATGGA TTTGGGATAA 720 

GGAATCTTGG TTTTACCTCA AATC TGATGG GAAAATAGCT GAAAAAGAAT GGGTCTACGA . 7 80 

TTCTCATAGT CAAGCTTGGT ACTACTTCAA ATC TGGTGGC TACATGGCGA AAAATGAGAC 840 

AGTAGATGGT TATCAGCTTG GAAGCGATGG TAAATGGCTT GGAGGAAAAA C TAC AAATGA 900 

AAATGCTGCT-TAGTATCAAG TAGTGCCTGT TACAGCCAAT GTTTATGATT CAGATGGTGA 960- 

AAAGCTTTCC TATATATCGC AAGGTAGTGT CGTATGGCTA GATAAGGATA GAAAAAGTGA ' 102 0 
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TGACAAGCGC TTGGC TATT A CTATTTCTGG TTTGTCAGGC TATATGAAAA CAGAAGATTT 1080 

ACAAGCGCTA GATGC TAGTA AGGACTTTAT C C C TT ATT AT GAGAGTGATG GCCACCGTTT 1140 

TTATCACTAT GTGGC TC AG A ATGCTAGTAT CCCAGTAGCT TCTCATCTTT CTGATATGGA 1200 

AGTAGGCAAG AAATATTATT CGGCAGATGG CCTGCATTTT GATGGTTTTA AGCTTGAGAA 1260 

TCCCTTCCTT TTCAAAGATT TAACAGAGGC TACAAACTAC AGTGCTGAAG AATTGGATAA 1320 

GGTATTTAGT TTGCTAAACA TTAACAATAG CCTTTTGGAG AACAAGGGCG CTACTTTTAA ' 1380 

GGAAGCCGAA G AAC ATT AC C ATATCAATGC TCTTTATCTC CTTGCCCATA GTGCCCTAGA 1440 

AAGTAACTGG GGAAGAAGTA AAATTGCCAA AGATAAGAAT AATTTCTTTG GCATTACAGC ' 1500 

CTATGATACG ACCCCTTACC TTTCTGCTAA GACATTTGAT GATGTGGATA AGGGAATTTT 1560 

AGGTGCAACC AAGTGGATTA AGGAAAATTA TATCGATAGG GGAAGAACTT TCCTTGGAAA 1620 

~CAAGGCTTCT GGTATGAATG TGGAATATGC TTCAGACCCT TATTGGGGCG AAAAAATTGC 1680 

TAGTGTGATG ATGAAAATCA ATGAGAAG 17 08 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 5 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Ala Lys Ser Glu Trp Val Glu Asp Lys Gly Ala Phe Tyr Tyr Leu Asp 

1 . -5 10 15 

Gin Asp Gly Lys Met Lys Arg Asn Ala Trp Val Gly Thr Ser Tyr Val 
20 25 30 . 

Gly Ala Thr Gly Ala Lys Val lie Glu' Asp Trp Val Tyr Asp Ser Gin 
35 40 45 

Tyr Asp Ala Trp Phe Tyr lie Lys Ala Asp Gly Gin His Ala Glu Lys 
50 55 60 

Glu Trp Leu Gin lie Lys Gly Lys Asp Tyr Tyr Phe Lys Ser Gly Gly 

65 70 ; . 75 ^ 80 

Tyr Leu Leu Thr Ser Gin Trp lie Asn Gin Ala Tyr Val Asn Ala Ser . 

85 90 95 

- Gly Ala , Lys Val Gin Gin Gly Trp Leu Phe Asp' Lys Gin Tyr Gin Ser 
100 105 _ - 110 

Trp Phe Tyr lie Lys Glu Asn Gly Asn Tyr Ala Asp Lys Glu Trp lie. . 

115 — 1-20 .125 ' ' * ■ 

Phe Glu Asn Gly His Tyr Tyr Tyr Leu Lys Ser Gly Gly Tyr Met Ala 
130 135 140 
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"Ala Asn Glu Trp lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Phe 
145 150. 155 160 

-Asp Gly Lys Met Ala Glu Lys * Glu Trp Val Tyr Asp Ser His Ser Gin 
165 170 175 

Ala Trp Tyr Tyr Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp 
180 185 190 

lie Trp Asp Lys Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie 
195 200 205 

Ala Glu Lys Glu Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr 
210 215 220 

Phe Lys Ser Gly Gly Tyr Met Thr Ala Asn Glu Trp lie Trp Asp Lys 
225 230 235 240 

Glu Ser Trp Phe Tyr Leu Lys Ser Asp Gly Lys lie Ala Glu Lys Glu 
' 245 250 255 

Trp Val Tyr Asp Ser His Ser Gin Ala Trp Tyr Tyr Phe Lys Ser Gly 
260 ^ 265 270 

Gly Tyr Met Ala Lys Asn Glu Thr Val Asp Gly Tyr Gin Leu Gly Ser 
275 280 285 

Asp Gly Lys Trp Leu Gly Gly Lys Thr Thr Asn Glu Asn Ala Ala Tyr 
. 290 295 300 

Tyr Gin Val Val Pro Val Thr Ala Asn Val Tyr Asp Ser Asp Gly Glu 
305 , 310 315 320 

Lys Leu Ser Tyr lie Ser Gin Gly Ser Val Val Trp Leu Asp Lys Asp 
325 . 330 335 

Arg Lys Ser Asp Asp Lys Arg Leu Ala lie Thr lie Ser Gly Leu Ser 
340 345 350 

Gly Tyr Met Lys Thr Glu Asp Leu Gin Ala Leu Asp Ala Ser Lys Asp 
355 - 360 365 

Phe He Pro Tyr Tyr Glu Ser Asp Gly His Arg Phe Tyr His Tyr Val 
370 375 380 

Ala Gin Asn Ala Ser He Pro Val Ala Ser His Leu Ser Asp Met Glu 
. 385 390 r 395 400 

Val Gly Lys Lys Tyr Tyr Ser Ala Asp Gly Leu His Phe Asp Gly Phe 
405 410 415- 

Lys Leu~Glu Asn Pro Phe Leu Phe Lys Asp Leu Thr Glu Ala Thr Asn 
420 425 430 

Tyr Ser Ala Glu Glu Leu Asp Lys Val Phe Ser Leu Leu Asn lie Asn 
435 ,440 - . 445 

Asn Ser Leu Leu Glu Asn Lys Gly Ala Thr Phe Lys Glu Ala Glu Glu 

450 - _ . 455 — — 4-6-Q— 



His Tyr His He Asn Ala Leu Tyr Leu Leu Ala His Ser Ala Leu Glu 
465^ 470 475 480 
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Ser Asn Trp Gly Arg Ser Lys lie Ala Lys Asp Lys Asn Asn Phe Phe 
485 490 495 

Gly He Thr Ala Tyr Asp Thr Thr Pro Tyr Leu Ser Ala Lys Thr Phe 
500 505 510 

Asp Asp Val Asp Lys Gly He Leu Gly Ala Thr Lys Trp He Lys Glu 
515 520 525 

Asn Tyr He Asp Arg Gly Arg Thr Phe Leu Gly Asn Lys Ala Ser Gly 
530 " 535 540 

^ Met Asn Val Glu Tyr Ala Ser Asp Pro Tyr Trp Gly Glu Lys He Ala 
545 550 555 . 560 

Ser Val Met Met Lys He Asn Glu Lys 
565 

(2.) INFORMATION^. FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 94 6 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
ATTTGCAGAT G ATTC TGAAG GATGGCAGTT TGTCCAAGAA AATGGTAGAA CCTACTACAA 60 

AAAGGGGGAT CTAAAAGAAA CCTACTGGAG AGTGATAGAT GGGAAGTACT ATTATTTTGA 12 0 

TCCTTTATCC GGAGAGATGG TTGTCGGCTG GCAATATATA CCTGCTCCAC ACAAGGGGGT - 18 0 

TAC GATTGGT CCTTCTCCAA GAATAGAGAT TGCTCTTAGA CCAGATTGGT TTTATTTTGG 240 

TCAAGATGGT GTATTACAAG AATT.TGTTGG CAAGCAAGTT TTAGAAGCAA AAACTGCTAC 3 00 

GAATACCAAC AAACATCATG GGGAAGAATA TGATAGCCAA GCAGAGAAAC GAGTCTATTA 3 60 

TTTTGAAGAT CAGCGTAGTT ATCATACTTT AAAAACTGGT TGGATTTATG AAGAGGGTCA 42 0 

- TTGGTATTAT TTACAGAAGG ATGGTGGCTT TGATTCGCGC ATCAACAGAT TGACGGTTGG 48 0 

AGAGCTAGCA CGTGGTTGGG TTAAGGATTA CCCTCTTACG TATGATGAAG AGAAGCTAAA 540 

AGC AGCTC C A TGGTACTATC TAAATCCAGC AACTGGCATT ATGCAAACAG GTTGGCAATA 600 

TCTAGGTAAT AGATGGTACT ACCTCCATTC GTCAGGAGCT ATGGCAACTG GCTGGTATAA - 660 

GGAAGGCTCA ACTTGGTACT ATCTAGATGC TGAAAATGGT GATATGAGAA CTGGCTGGCA 72 0 

AAACCTTGGG • AACAAATGGT ACTATCTCCG TTCATCAGGA GCTATGGCAA CTGGTTGGTA 780 

TCAGGAAAGT TCGACTTGGT ACTATCTAAA^TGCAAGTAAT GGAGATATGA AAACAGGCTG 840 
GTTCCAAGTC AATGGTAACT GGTACTATGC C TATGATTC A GGTGCTTTAG C TGTTAATAC~— — -900 

~CACAGTAGGT GGTTACTACT TAAACTATAA TGGTGAATGG GTTAAG " " 946 
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INFORMATION FOR SEQ ID NO: 15 6: 

(i) SEQUENCE CHARACTERISTICS: 

- . (A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single < 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: . 

Val Phe Ala Asp Asp Ser Glu Gly Trp Gin Phe Val Gin Glu Asn Gly 
1 5 10 15 

Arg Thr Tyr Tyr Lys Lys Gly Asp Leu Lys Glu Thr Tyr Trp Arg Val 
20 25 30 

lie Asp Gly Lys Tyr Tyr Tyr Phe Asp Pro Leu Ser Gly Glu Met Val 
3 5 "40 ' 45 

Val Gly Trp Gin Tyr lie Pro Ala Pro His Lys Gly Val Thr lie Gly 
50 55 60 

Pro Ser Pro Arg lie- Glu lie Ala Leu Arg Pro Asp Trp Phe Tyr Phe 
65 70 . 75 80 

Gly Gin Asp Gly Val Leu Gin Glu Phe Val Gly Lys Gin Val Leu Glu 
85 90 95 

Ala Lys Thr Ala Thr Asn Thr Asn Lys His His Gly Glu Glu Tyr Asp 

i .. . „ . 100 105 110 

Ser Gin Ala Glu Lys Arg Val Tyr Tyr Phe Glu Asp Gin Arg Ser Tyr 
115 -• 120 ' 125 

His Thr Leu Lys Thr Gly Trp lie Tyr Glu Glu Gly His Trp Tyr Tyr 
. 130 135 140 

Leu Gin Lys Asp Gly Gly Phe Asp Ser Arg, lie Asn Arg Leu Thr" Val 
145 150 155 .160 

Gly Glu Leu Ala Arg Gly Trp Val Lys Asp Tyr Pro Leu Thr Tyr Asp 
165 170 175 

Glu Glu Lys Leu Lys Ala Ala- Pro Trp Tyr Tyr Leu Asn Pro Ala Thr 
180 185 190 

Gly lie Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Arg Trp Tyr Tyr 
195 200 205 

Leu His Ser Ser Gly Ala Met Ala Thr Gly Trp" Tyr Lys Glu Gly Ser 
210 215 220^ 

Thr Trp Tyr Tyr Leu Asp Ala Glu Asn Gly Asp Met Arg Thr Gly Trp 
225 230 235 240 

Gin Asn Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met 

245 ... 250 — . 255 

Ala Thr Gly -Trp Tyr Gin Glu Ser Ser Thr Trp Tyr Tyr Leu Asn Ala 
260 _ 265 ' 270 
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Ser Asn Gly Asp Met Lys Thr Gly Trp Phe Gin Val Asn Gly Asn Trp 
275 280 285 

Tyr Tyr Ala Tyr Asp Ser Gly Ala Leu Ala Val Asn Thr .-Thr Val Gly 
290 295 300 

Gly Tyr Tyr Leu Asn Tyr Asn Gly Glu Trp Val Lys 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ' 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:- 157 : ' """""" " — ™ 

TGTCGCTGCA AATGAAACTG AAGTAGCAAA AACTTC GC AG GAT AC AAC GA CAGCTTCAAG 60 
T AGTT C AG AG CAAAATCAGT CTTCTAATAA AACGCAAACG AGCGCAGAAG TACAGACTAA 120 
TGCTGCTGCC CACTGGGATG GGGATTATTA TGTAAAGGAT GATGGTTC TA AAGCTCAAAG 18 0 

TGAATGGATT TTTGAC AAC T ACTATAAGGC TTGGTTTTAT ATTAATTCAG ATGGTC GTTA 240 
CTCGCAGAAT GAATGGCATG G AAATTAC TA CCTGAAATCA GGTGGATATA TGGCCCAAAA 300 
CGAGTGGATC TATGACAGTA ATTACAAGAG TTGGTTTTAT CTCAAGTCAG ATGGGGCTTA 3 60 

TGCTCATCAA GAATGGCAAT TGATTGGAAA TAAGTGGTAC TACTTCAAGA AGTGGGGTTA 42 0 

CATGGCTAAA AGCCAATGGC AAGGAAGTTA TTTCTTGAAT GGTCAAGGAG CTATGATGCA 48 0 

AAATGAATGG CTSCTATGAT CCAGCCTATT CTGCTTATTT TTATCTAAAA TCCGATGGAA 540 ' 

CTTATGCTAA CCAAGAGTGG CAAAAAGTGG GCGGCAAATG GTACTATTTC AAGAAGTGGG 600 
GCTATATGGC TCGGAATGAG TGGCAAGGCA AC TACT ATTT GACTGGAAGT GGTGCCATGG 660 
CGACTGACGA AGTGATTATG GATGGT AC TC GCTATATCTT TGCGGCCTCT GGTGAGCTCA 72 0 

AAGAAAAAAA AGATTTGAAT GTCGGCTGGG TTCACAGAGA TGGTAAGCGC TATTTCTTTA 78 0 

ATAATAGAGA AGAACAAGTG GGAACCGAAC ATGCTAAGAA AGTCATTGAT ATTAGTGAGC 840 
ACAATGGTCG TATCAATGAT TGGAAAAAGG TTATTGATGA GAACGAAGTG GATGGTGTCA . 900 
TTGTTCGTCT AGGTTATAGC GGTAAAGAAG ACAAGGAATT GGCGCATAAC ATTAAGGAGT 960 

TAAACCGTCT GGGAATTCCT TATGGTGTCT ATCTCTATAC CTATGCTGAA AATGAGACCG 1020 

ATGCTGAGAG TGACGCTAAA CAGACCATTG AACTTATAAA GAAATACAAT ATGAACCTGT 1080 

CTTACCCTAT CTATTATGAT GTTGAGAATT GGGAATATGT AAATAAGAGC AAGAGAGCTC 1140 

. CAAGTGATAC AGGCACTTGG GTTAAAATCA TCAACAAGTA CATGGACACG ATGAAGCAGG 1200.. 

CGGGTTATCA AAATGTGTAT GTC TATAGCT ATCGTAGTTT ATTAC AG AC G CGTTTAAAAC 12 60 
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ACCCAGATAT TTTAAAACAT GTAAACTGGG TAGCGGCCTA TACGAATGCT TTAGAATGGG 
AAAACCCTCA .TTATTCAGGA AAAAAAGGTT GGCAATATAC CTCTTCTGAA TACATGAAAG 
GAATCCAAGG GCGCGTAGAT GTCAGCGTTT GGTAT 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids ■ . 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Val Ala Ala Asn Glu Thr Glu Val Ala Lys Thr Ser Gin Asp Thr Thr~ 
1 5 10 15 

Thr Ala Ser Ser Ser Ser Glu Gin Asn Gin Ser Ser Asn Lys Thr Gin 
20 \ 25 30 

Thr Ser Ala Glu Val Gin Thr Asn Ala Ala Ala His Trp Asp Gly Asp 
35 40 45 

Tyr Tyr Val Lys Asp Asp Gly Ser Lys Ala Gin Ser Glu Trp lie Phe 
50 55 60 

Asp Asn Tyr Tyr Lys Ala Trp Phe Tyr lie Asn Ser .Asp Gly Arg Tyr 
65 70 75 80 

Ser Gin Asn Glu Trp His Gly Asn Tyr Tyr Leu Lys Ser Gly Gly Tyr 
85 90 95 

Met Ala Gin Asn Glu Trp ^Ile Tyr Asp Ser Asn Tyr Lys Ser Trp Phe 
100 105 - 110 

Tyr Leu Lys Ser Asp Gly Ala Tyr Ala His Gin Glu Trp Gin. Leu lie 
115 120 125 

Gly Asn Lys Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Ala Lys Ser 
130 135 140 

Gin Trp Gin Gly Ser Tyr Phe Leu Asn Gly Gin Gly Ala Met Met Gin 
145 150 155 160 

Asn Glu Trp Leu Tyr Asp Pro Ala Tyr Ser Ala Tyr Phe Tyr Leu Lys 
165 170 175 

Ser Asp Gly Thr Tyr Ala Asn Gin Glu Trp Gin Lys Val Gly Gly Lys 
180 185 190" _ 

Trp Tyr Tyr Phe Lys Lys Trp Gly Tyr Met Ala Arg Asn Glu Trp Gin 
•195 • 200 205 

Gly Asn Tyr Tyr Leu Thr Gly Ser Gly Ala Met Ala Thr Asp Glu Val 
210. .215 220 

Ile_ Met Asp Gly Thr Arg Tyr lie Phe Ala Ala Ser Gly Glu Leu Lys 
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225 

Glu Lys Lys Asp 



Tyr Phe Phe Asn 
260 

. Lys Val lie Asp 
275 ' 

Lys Val lie Asp 

. 290 

Tyr Ser Gly Lys 
_ 305 

Asn Arg Leu Gly 



Asn Glu Thr Asp 
340 



230 

Leu Asn Val Gly 
245 

Asn - Arg Glu Glu 



lie Ser Glu His 
280 

Glu Asn Glu Val 
295 

Glu Asp Lys Glu 
310 

lie Pro Tyr Gly 
325 

Ala Glu Ser Asp 



235 

Trp Val His Arg 
250 

Gin Val Gly Thr 
2 65 

Asn Gly Arg lie 



Asp Gly Val lie 
300 

Leu Ala His Asn 
315 

Val Tyr Leu Tyr 
330 

Ala Lys Gin Thr 
345 



240 

Asp Gly Lys Arg 
. 255 

Glu. His Ala Lys. 
270 

Asn Asp Trp Lys 
285 

Val Arg Leu Gly 



lie Lys Glu Leu 
320 

Thr Tyr Ala Glu 
335 

lie Glu Leu lie 
350 



Lys Lys Tyr Asn Met Asn Leu Ser Tyr Pro lie Tyr Tyr Asp Val Glu 
355 360 365 

Asn Trp Glu Tyr Val Asn Lys Ser Lys Arg Ala Pro Ser Asp Thr Gly 
370 375 380 



Thr Trp Val Lys lie lie Asn Lys Tyr Met Asp Thr Met Lys Gin Ala 
385 390 .395 400 

Gly Tyr Gin Asn Val Tyr Val Tyr Ser Tyr. Arg Ser Leu Leu Gin, Thr 
405 410 415 

Arg Leu Lys His Pro Asp lie Leu Lys His Val Asn Trp Val Ala Ala 
420 425 430 

Tyr Thr Asn Ala Leu Glu Trp Glu Asn Pro His Tyr Ser Gly Lys . Lys 
435 440 445 * ' ' 

Gly Trp Gin Tyr Thr Ser Ser Glu Tyr Met Lys Gly lie Gin Gly Arg 
450 455 . 460 

Val Asp Val Ser Val Trp Tyr 

465 1 . 470 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 1924 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
TACGTCTCAG CCTACTTTTG TAAGAGCAGA AGAATCTCCA CAAGTTGTCG AAAAATC TTC 60 
ATTAGAGAAG AAATATGAGG AAGCAAAAGC AAAAGCTGAT ACTGCCAAGA AAGATTACGA 120 
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AACGGCTAAA AAGAAAGCAG AAGACGCTCA GAAAAAGTAT GAAGATGATC AGAAGAGAAC 18 Q 

TGAGGAGAAA GCTCGAAAAG AAGCAGAAGC ATCTCAAAAA TTGAATGATG TGGCGCTTGT . 240- 

TGTTCAAAAT GCATATAAAG AGTACC GAGA AGTTCAAAAT CAACGTAGTA AATATAAATC 3 00 

TGACGCTGAA TATCAGAAAA AATTAACAGA GGTCGACTCT AAAATAGAGA AGGCTAGGAA 3 60 

AGAGCAACAG GACTTGCAAA ATAAATTTAA TGAAGTAAGA GCAGTTGTAG TTC CTGAACC 420 

AAATGC GTTG GCTGAGACTA AGAAAAAAGC AGAAGAAGCT AAAGCAGAAG AAAAAGTAGC 480 

TAAGAGAAAA TATGATTATG CAACTCTAAA GGTAGCACTA GCGAAGAAAG AAGTAGAGGC 540 

TAAGGAACTT G AAATT G AAA AACTTCAATA TGAAATTTCT ACTTTGGAAC AAGAAGTTGC 600 

TACTGCTCAA CATCAAGTAG ATAATTTGAA AAAACTTCTT GCTGGTGCGG ATCCTGATGA 660 

TGGCACAGAA GTTATAGAAG CTAAATTAAA AAAAGGAGAA GCTGAGCTAA AC GC TAAAC A 72 0 

AGC TGAGTT A GCAAAAAAAC AAACAGAACT TGAAAAACTT CTTGACAGCC TTGATCCTGA 78 0 

AGGTAAGACT CAGGATGAAT TAGATAAAGA AGCAGAAGAA GCTGAGTTGG ATAAAAAAGC 84 0 

TGATGAACTT CAAAATAAAG TTGCTGATTT AGAAAAAGAA ATTAGTAACC TTGAAATATT 900 

ACTTGGAGGG GCTGATNCTG AAGATGATAC TGCTGCTCTT CAAAATAAAT TAGCTACTAA 960 

AAAAGCTGAA TTGGAAAAAA CTCAAAAAGA ATTAGATGCA GCTCTTAATG AGTTAGGGCC 102 0 

TGATGGAGAT GAAGAAGAAA CTCCAGCGCC GGCTCCTCAA CCAGAGCAAC CAGCTCCTGC -i080 

ACCAAAACCA GAGCAACCAG CTCCAGCTCC AAAACCAGAG CAACCAGCTC CTGCACCAAA 1140 

ACCAGAGCAA CCAGCTCCAG CTCCAAAACC AGAGCAACCA GCTCCAGCTC CAAAACCAGA 1200 
GCAACCAGCT AAGCC GGAGA AACCAGCTGA AGAGCCTACT C AAC C AG AAA AACCAGCCAC . ' 12 60 

TCCAAAAACA GGCTGGAAAC AAGAAAACGG TATGTGGTAT TTCTACAATA CTGATGGTTC 13 20 

AATGGCAATA GGTTGGCTCC AAAACAACGG TTCATGGTAC TACCTAAACG CTAACGGCGC 13 80 

TATGGCAACA GGTTGGGTGA AAGATGGAGA TACCTGGTAC TATCTTGAAG CATCAGGTGC 1440 

TATGAAAGCA AGCCAATGGT TCAAAGTATC AGATAAATGG TACTATGTCA ACAGCAATGG 1500 

CGC TATGGCG ACAGGCTGGC TCCAATACAA TGGCTCATGG TACTAC CTC A ACGCTAATGG 1560 

TGATATGGCG ACAGGATGGC TCCAATACAA CGGTTCATGG TATTAC CTC A ACGCTAATGG 1620 

TGATATGGC G ACAGGATGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA ACGCTAACGG 1680 

TGCTATGGCT ACAGGTTGGG CTAAAGTCAA CGGTTCATGG TACTACCTAA ACGCTAACGG 1740 

TTCAATGGCA ACAGGTTGGG TGAAAGATGG AGATAC CTGG TACTATCTTG AAGCATCAGG 1800 

TGC TATG AAA GCAAGCCAAT GGTTCAAAGT ATCAGATAAA TGGTACTATG TCAATGGCTT 1860 

AGGTGCCCTT GCAGTCAACA CAACTGTAGA TGGCTATAAA GTCAATGCCA ATGGTGAATG . 1920 

GGTT ~ — : 1924 

(2) INFORMATION FOR SEQ ID NO: 160: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Thr Ser Gin Pro Thr Phe Val Arg Ala Glu Glu Ser Pro Gin Val Val 
1 . - ■ 5 10 15 

Glu Lys Ser Ser Leu Glu Lys Lys Tyr Glu Glu Ala Lys Ala Lys Ala 
20 25 30 

Asp Thr Ala Lys Lys Asp Tyr Glu Thr Ala Lys Lys Lys Ala Glu Asp 
35 40 45 

Ala Gin Lys Lys- Tyr Glu Asp Asp Gin Lys Arg Thr Glu Glu Lys Ala 
50 55 60. 

Arg Lys Glu Ala Glu Ala Ser Gin Lys Leu Asn Asp Val Ala Leu Val 
65 70 75 80 

Val Gin Asn Ala Tyr Lys Glu Tyr Arg Glu Val Gin Asn Gin Arg Ser 
85 90. 95 

Lys Tyr Lys Ser Asp Ala Glu Tyr Gin Lys Lys Leu Thr Glu Val Asp 
100 105 1 110 

Ser Lys lie Glu Lys Ala Arg Lys Glu Gin Gin Asp Leu Gin Asn Lys 
115 120 125 

Phe Asn Glu Val Arg Ala Val Val Val Pro Glu Pro Asn Ala Leu Ala 
130 135 140 

Glu Thr Lys Lys Lys Ala Glu Glu Ala Lys Ala Glu Glu Lys Val Ala 
145 150 155 160 

Lys Arg Lys Tyr Asp Tyr Ala Thr Leu Lys Val Ala Leu Ala Lys Lys 
' 165 170 175 

Glu Val Glu Ala Lys Glu Leu Glu lie Glu Lys Leu Gin Tyr Glu lie 
180 185 190 

Ser Thr Leu Glu Gin Glu Val Ala Thr Ala Gin His Gin Val Asp Asn 
195 200 205 

Leu Lys Lys Leu Leu Ala Gly Ala Asp Pro Asp Asp Gly Thr Glu Val 
210 215 220 

lie Glu Ala Lys Leu Lys Lys Gly Glu Ala Glu Leu Asn Ala Lys Gin 
225 . 230 235 . 240 

Ala Glu Leu Ala Lys Lys Gin Thr Glu Leu Glu Lys Leu Leu Asp Ser 
245 250 _ 255 



Leu Asp Pro Glu 
260 



Gly Lys Thr Gin Asp Glu Leu Asp Lys- Glu Ala Glu 
265 270 
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Glu Ala Glu Leu Asp Lys Lys Ala Asp Glu Leu Gin Asn Lys Val Ala 
275 280 • < 285 

Asp Leu Glu Lys 'Glu lie Ser Asn Leu Glu lie Leu Leu Gly Gly Ala 
290 295 300 

Asp Xaa Glu Asp Asp Thr Ala Ala Leu Gin Asn Lys Leu Ala Thr Lys 
305 310 315 320 

Lys Ala Glu Leu Glu Lys Thr Gin Lys Glu Leu Asp Ala Ala Leu Asn 
325 330 335 

Glu Leu Gly Pro Asp Gly Asp Glu Glu Glu Thr Pro Ala Pro Ala Pro 
340 - 345 350 

Gin Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu Gin Pro Ala Pro 
355 360 365 

Ala Pro Lys Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu Gin Pro 
370 375 380 

Ala Pro Ala Pro Lys Pro Glu Gin Pro Ala Pro Ala Pro Lys Pro Glu 
385 390 395 400 

Gin Pro Ala Lys Pro Glu Lys Pro Ala Glu Glu Pro Thr Gin Pro Glu 
405 410 .415 

Lys Pro Ala Thr Pro Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp 
420 425 430 

Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala lie Gly Trp Leu Gin Asn 
435 440 445 

Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly 
450 455 460 

Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala 
465 470 475 480 

Met Lys Ala Ser Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val 
485 490 495 

Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser 
500 505 510 

Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr Gly Trp Leu Gin 
515 520 525 

Tyr Asn Gly Ser Trp Tyr Tyr Leu Ash Ala Asn Gly Asp Met Ala Thr 
530 535 540 

Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
545 550 555 560 

Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu 

. 565 ' "' 570 - 575 

Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Val Lys Asp Gly Asp Thr 

580 " 585 - 590 



Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Lys Ala Ser Gin Trp Phe 
595 600 605. 
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Lys Val Ser Asp Lys- Trp Tyr Tyr Val Asn Gly Leu Gly Ala Leu Ala " 
610 615 620 

Val Asn Thr Thr: Val Asp Gly Tyr Lys Val Asn Ala Asn Gly Glu Trp 
625 — 630 635 640- 

Val 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TGGACAGGTG AAAGGTCATG CTACATTTGT GAAATCCATG AC AAC TGAAA TGTACCAAGA 60 

ACAACAGAAC CATTCTCTCG CCTACAATCA ACGCTTGGNT TCGCAAAATC GCATTGTAGA 12 0 

TCCTTTTTTG GC GGAGGGAT ATGAGGTCAA TTACCAAGTG TCTGACGACC CTGATGCAGT 180 

CTATGGTTAC TTGTCTATTC CAAGTTTGGA AATCATGGAG CCGGTTTATT TGGGAGCAGA 2 40 

TTATCATCAT TTAGGGATGG GCTTGGCTCA TGTGGATGGT ACACCGCTGC CTCTGGATGG 3 00 

TACAGGGATT CGCTCAGTGA TTGC TGGGC A C C GTGC AGAG CCAAGCCATG TCTTTTTCCG 3 60 

CCATTTGGAT CAGCTAAAAG TTGGAGATGC TCTTTATTAT GAT AATGGC C AGGAAATTGT 420 

AGAATATCAG ATGATGGACA CAGAGATTAT TTTACCGTCG GAATGGGAAA AATTAGAATC 480 

- GGTTAGCTCT AAAAATATCA TG AC CTTGAT AACCTGCGAT CCGATTCCTA C CTTTAATAA 540 

ACGCTTATTA GTGAATTTTG AACGAGTCGC TGTTTATCAA AAATCAGATC CACAAACAGC 600 

TGCAGTTGCG AGGGTTGCTT TTACGAAAGA AGGACAATCT GTATCGCGTG TTGCAACCTC 660 

TCAATGGTTG 670 



(2) INFORMATION FOR SEQ ID NO:162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single. 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Gly Gin Val Lys Gly His Ala Thr Phe Val Lys Ser Met Thr Thr Glu 

— 1 , 5 10 • — 15 

Met Tyr Gin Glu Gin Gin Asn His Ser Leu Ala Tyr Asn Gin Arg Leu 
20 25 30 
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Xaa Ser Gin Asn Arg lie Val Asp Pro Phe Leu Ala Glu Gly Tyr Glu 
35 40 45 

Val Asn Tyr Gin Val Ser Asp Asp Pro Asp Ala Val Tyr Gly Tyr Leu 
50 55 60 

Ser lie Pro Ser Leu Glu lie Met Glu Pro Val Tyr Leu Gly Ala Asp 
65 70 75 80 

Tyr His His Leu Gly Met Gly Leu Ala His Val Asp Gly Thr Pro Leu 
85 • '90. 95 

Pro Leu Asp Gly Thr Gly lie Arg Ser Val lie Ala Gly His Arg Ala 
.100 105 110 

Glu Pro Ser His Val Phe Phe Arg His Leu Asp Gin Leu Lys Val Gly 
115 12.0 125 

Asp Ala Leu Tyr Tyr Asp Asn Gly Gin Glu lie Val ' Glu Tyr Gin Met 

13 0 " " ' 13 5" " 14 0 

Met Asp Thr Glu lie lie Leu Pro Ser Glu Trp Glu Lys Leu Glu Ser 
145 150 155 160 

Val Ser Ser Lys Asn lie Met Thr Leu lie Thr Cys Asp Pro lie Pro 
165 170 175 

Thr Phe Asn Lys Arg Leu Leu Val Asn Phe Glu Arg Val Ala Val Tyr 
180 „ 185 190 

Gin Lys Ser Asp Pro Gin Thr Ala Ala Val Ala Arg Val Ala Phe Thr 
195 200 205 

Lys Glu Gly Gin Ser Val Ser Arg Val Ala Thr Ser Gin Trp Leu 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear ; 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

GATTGCTCCT TTGAAGGATT TGAGAGAAAC CATGTTGGAA ATTGCTTCTG GTGCTCAAAA 

TCTTCGTGCC AAGGAAGTTG GTGCCTATGA AC TG AG AG AA GTAACTCGCC AATTTAATGC 

TATGTTGGAT CAGATTGATC AGTTGATGGT AGCTATTCGT AGCCAGGAAG AAACGACCCG 

TCAGTACCAA CTTCAAGCCC TTTCGAGCCA GATTAATCCA CATTTCCTCT ATAACACTTT 

GGAC AC CATC ATCTGGATGG CTGAATTTCA TGATAGTCAG CGAGTGGTGC AGGTGACCAA 

GTCCTTGGCA ACCTATTTCC GCTTGGCGCT CAATCAAGGC AAGGACTTGA TTTGTCTGTC— 

TGACGAAATC AATCATGTCC GCCAGTATCT CTTTATCCAG AAACAACGCT ATGGAGATAA 



Iff! 
4- i 
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GCTGGAATAC GAAATTAATG AAAATGTTGC CTTTGATAAT TTAGTCTTAC CCAAGCTGGT 48 0 

CCTACAACCC CTTGTAGAAA ATGCTCTTTA CCATGGCATT AAGGAAAAGG AAGGTC AGGG 540 
C CAT ATT AAA CTTTCTGTCC AGAAACAGGA TTC GGGATTG GTCATCCGTA TTGAGGATGA - 600 

TGGCGTTGGC TTCCAAGATG. CTGGTGATAG TAGTCAAAGT CAACTCAAAC GTGGGGGAGT 660 

TGGTCTTCAA AATGTC GATC AACGGCTCAA ACTTCATTTT GGAGCCAATT AC C ATATGAA 720 

- GATTGATTCT AGACCCCAAA AAGGGACGAA AGTTGAAATA TATATAAATA GAATAGAAAC 78 0 

TAGC „ . v ■ 784 
(2) INFORMATION FOR SEQ ID NO : 164 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 61 amino acids 

(B) TYPE: amino acid 

'(C) STRANDEDNESS : single " """" " " 

(D) TOPOLOGY: linear """ ~~~ ' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



iUi. He Ala Pro Leu Lys Asp Leu Arg Glu Thr Met Leu Glu He Ala Ser 

I 1 5 10 15 

r! Gly Ala Gin Asn Leu Arg Ala Lys Glu Val Gly Ala Tyr Glu Leu Arg 

P 20 25 -~ 30 

1U Glu- Val Thr Arg Gin Phe Asn Ala Met Leu Asp Gin He Asp Gin Leu 

O ' 35 40 45 

Met Val Ala He Arg Ser Gin Glu Glu Thr Thr Arg Gin Tyr Gin Leu 

50- 55 , 60 

Gin Ala Leu Ser Ser Gin He Asn Pro His Phe Leu Tyr Asn Thr Leu 

65 70 . 75 80 

Asp Thr. He He Trp Met Ala Glu Phe His Asp Ser Gin Arg Val Val 
85 90 " 95 

- Gin Val Thr Lys Ser Leu Ala Thr Tyr Phe Arg Leu Ala Leu Asn Gin 
100 105 ■* 110 ■ 

Gly Lys Asp Leu He Cys Leu Ser Asp Glu He Asn His Val Arg Gin 
115 120 125 

Tyr Leu Phe He Gin Lys Gin Arg Tyr Gly Asp Lys Leu Glu Tyr Glu 
130 135 140 

He Asn Glu Asn Val Ala Phe Asp Asn Leu Val Leu Pro Lys Leu Val 

145 . 150 155 ; 160 

Leu Gin Pro Leu Val Glu Asn Ala Leu Tyr His" Gly Ile Lys Glu Lys 
165 170 _ " 175 

Glu Gly Gin Gly His He Lys Leu Ser -Val -Gin Lys Gin Asp Ser Gly 

180 • 185 190 ■ 
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Leu Val lie Arg lie Glu Asp Asp Gly Val Gly Phe Gin Asp Ala Gly 
195 200 205 

Asp Ser Ser Gin Ser Gin Leu Lys Arg Gly Gly Val Gly Leu Gin Asn 
210 - 215 220 

Val Asp Gin Arg Leu Lys Leu His Phe Gly Ala Asn. Tyr His Met Lys 
225 230 235 240 

lie Asp Ser Arg Pro Gin Lys Gly Thr Lys Val Glu lie Tyr lie Asn 
245 250 255 . 

Arg lie Glu Thr Ser 
260 

(2) INFORMATION FOR SEQ ID NO : 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 
'(B) TYPE: nucleic acid _ 

(C) STRANDEDNESS : double - _ _ 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGGCTC CATAATATCT ATAAGGGATT 
TACCCACTAC AAATATTATA GAGCCGAAAA TTCACATCTA ATATATGCAG ACTACTTTGA 
AATGAAATTA AAAAAATTAT TAAAGGATGA CACAAAAGTT TTTGAAAAAT CTACATTCAA ' 
ATTTGTAGAA GGATATAAAA TATACC TGAC AG AATC T AAA GAATCTGGAA TTAAACAAAT 
GGACAATGTC ATAAAATATT TTGAGTTTAT TGAATC TAAA AGTATTGCTT TATATTTTCA 
AAAACGATTA AATGAGCTGA TAGAT ' 
(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:166: 

Arg Ser Tyr Gly Thr Phe Phe Leu Gin Gin Asn Arg Leu His Asn lie 
1 5 - 10 15 



Tyr Lys Gly Phe Thr His Tyr Lys Tyr Tyr Arg Ala Glu Asn Ser His 

20 25 30 

Leu lie Tyr Ala Asp Tyr Phe Glu Met Lys Leu Lys Lys Leu Leu Lys 
35 40 . 45 

Asp Asp Thr Lys Val Phe Glu Lys Ser" Thr Phe Lys Phe Val Glu Gly 

50 55 . ...60 
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Tyr Lys He Tyr Leu Thr Glu Ser Lys Glu Ser Gly He Lys Gin Met 
65 70 75 80 

Asp Asn Val lie Lys Tyr Phe Glu Phe He Glu Ser Lys Ser He Ala 
85 90 95 

Leu Tyr Phe "Gin Lys Arg Leu Asn Glu Leu He Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CAACGTTGAG AATTATTTGC GAATGTGTTT GGATAGCATT CAGAATCAGA CGTATCAAAA 60 

TTTTGAG TGT TTATTAATCA ATGATGGCTC TCCAGATCAT TCATCCAAAA TATGTGAAGA 12 0 

ATTTGTAGAG AAAGATTCTC GTTTCAAATA TTTTGAGAAA GCAAACGGCG GTCTTTCATC 18 0 

AGCTCGTAAC CTAGGTATTG AATGTTCGGG GGGGGGC GTA CATTACTTTT GTAGACTC 23 8 
(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: . 

Asn Val Glu Asn. Tyr Leu Arg Met Cys Leu Asp Ser He Gin Asn Gin 
1 5 10 " 15 

Thr Tyr Gin Asn Phe Glu Cys Leu Leu He Asn Asp Gly Ser Pro Asp 
20 25 .30 

His Ser Ser Lys He Cys Glu Glu Phe Val Glu Lys Asp Ser Arg Phe 
35 .40 45 

. Lys Tyr Phe Glu Lys Ala Asn Gly Gly Leu Ser Ser Ala Arg Asn Leu 
50 55 60 

Gly lie Glu Cys Ser Gly Gly Gly Val His Tyr Phe Cys Arg Leu 

.65 70 75. . 

(2) INFORMATION FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS :. .. 

(A) LENGTH: 742 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

CTACTATCAA TCAAGTTCTT CAGCCATTGA GGCCACCATT GAGGGCAACA GCCAAACGAC 60 

CATCAGCCAG ACTAGCCACT TTATTCAGTC TTATATCAAA AAACTAGAAA CCACCTCGAC 120 

TGGTTTG AC C CAGCAGACGG ATGTTCTGGC CTATGCTGAG AATCCCAGTC AAGACAAGGT 180 

CGAGGGAATC CGAGATTTGT TTTTGACCAT CTTGAAGTCA GATAAGGACT TGAAAACTGT 240 

TGTGCTGGTG ACCAAATCTG GTCAGGTCAT TTCTACAGAT GACAGTGTGC AGATGAAAAC 3 00 

TTCCTCTGAT ATGATGGCTG AGGATTGGTA CCAAAAGGCC ATTCATCAGG GAGCTATGCC 3 60 

TGTTTTGACT CCAGCTCGTA AATCAGATAG TCAGTGGGTC ATTTCTGTCA CTCAAGAACT 42 0 

TGTTGATGCA AAGGGAGCCA ATC TTGGTGT GCTTCGTTTG GATATTTCTT ATGAAACTCT 48 0 

GGAAGCC TAT CTCAATCAAC TCCAGTTGGG GCAGCAGGGC TTTGCCTTCA TTATCAATGA 54 0 

AAACCATGAA TTTGTCTACC ATCCTCAACA CACAGTTTAT AGTTCGTCTA GCAAAATGGA 600 

GGCTATGAAA CCCTACATCG ATACAGGTCA GGGTTATACT CCTGGTCACA AATCCTACGT 660 

CAGTCAAGAG AAGATTGCAG GAACTGATTG GACGGTGCTT GGCGTGTCAT CATTGGAAAA 72 0 

GTTAGAC C AG GTTCGGAGTC AG 742 



(2)' INFORMATION FOR SEQ ID NO: 170: ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids - 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO:170: 

Tyr Tyr Gin Ser Ser Ser Ser Ala lie Glu Ala Thr lie Glu Gly Ash 
1 5 10 15 

Ser Gin Thr Thr lie Ser Gin Thr Ser His Phe lie Gin Ser Tyr lie 
20 25 30 

Lys Lys Leu Glu Thr Thr Ser Thr Gly Leu Thr Gin Gin Thr Asp Val 
35 • 40 45 

Leu Ala Tyr Ala Glu Asn Pro Ser Gin Asp Lys Val Glu Gly lie Arg' 
50 55 60 

Asp Leu Phe Leu Thr lie Leu Lys Ser Asp Lys Asp . Leu Lys Thr Val 

65 70 = * . 75' ' ■ 80 

Val Leu Val Thr Lys Ser Gly Gin Val lie Ser Thr Asp Asp Ser Val 
85 90 95 
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Gin Met Lys Thr Ser Ser Asp Met 
100 

Ala lie His Gin Gly Ala Met Pro 
115 120 



Met Ala Glu Asp Trp Tyr Gin Lys 
105 110 

Val Leu Thr Pro Ala Arg Lys, Ser . 
125 



Asp Ser Gin Trp Val lie Ser Val Thr Gin Glu Leu Val Asp Ala Lys 

130 135 140 

Gly Ala Asri Leu Gly Val Leu Arg Leu Asp lie Ser Tyr Glu Thr Leu 
145 150 155 ■ 160 

Glu Ala Tyr Leu Asn Gin Leu Gin Leu Gly Gin Gin Gly Phe Ala Phe 
*165 170 175 

lie lie Asn Glu Asn His Glu- Phe Val Tyr His Pro Gin His Thr Val 
180 185 190 



■Tyr Ser Ser Ser Ser Lys "Met Glu Ala Met Lys Pro Tyr lie Asp Thr 
- 195 - 200 205 

Gly Gin Gly Tyr Thr Pro Gly His Lys Ser Tyr Val Ser Gin Glu Lys 
210 ' 215 220 

lie Ala Gly Thr Asp Trp Thr Val Leu Gly Val Ser Ser Leu Glu Lys 

225 230 235 240 

Leu Asp Gin Val Arg Ser Gin 
245 

(2) INFORMATION FOR SEQ ID NO: 171: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

GACAAAAACA TTAAAAC GTC CTGAGGTTTT ATCACCTGCA GGGACTTTAG AGAAGCTAAA 60 

GGTAGCTGTT CAGTATGGAG CAGATGCTGT CTTTATCGGT GGTCAGGCCT ATGGTCTTCG 120 

TAGCCGTGCG GGAAACTTTA CTTTCGAACA GATGGAAGAA GGCGTGCAGT TTGCGGCCAA" 18 0 

GTATGGTGCC AAGGTCTATG TAGCGGCTAA TATGGTTATG CACGAAGGAA ATGAAGCTGG 240 

TGCTGGTGAG TGGTTCCGTA AACTGCGTGA TATCGGGATT GCAGCAGTTA TCGTATCTGA 3 00 

CCCAGCCTTG ATTATGATTG CAGTGACTGA AGCACCAGGC CTTGAAATCC ACCTTTCTAC 3 60 

CCAAGCCAGT GCCACTAACT ATGAAACCCT TGAGTTCTGG AAAGAGCTAG GCTTGACTCG 420 

TGTCGTTTTA GCGCGTGAGG TTTCAATGGA AGAATTAGCT GAGATCCGCA AACGTACAGA 480 

TGTTGAAATT GAAGC CTTTG TC C ATGGAGC TATGTGTATT— TGA-TAGTGTG GACGTTGTAC 540 

TCTTTCAAAC CACATGAGTA TGCGTGATGC CAACCGTGGT GGATGTTCTC AGTCATGCCG 600 
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TTGGAAATAC GACCTTTACG ATATGCCATT TGGGAAAGAA CGTAAGAGTT TGCAGGGTGA 660 

GATTCCAGAA GAATTTTCAA TGTCAGCCGT TGACATGTCT ATGATTG AC C ANATTC C AG A 720 

TATGATTGAA AATGGTGTGG ACAGTCTAAA AATCGAAGGA CGTATGNAGT CTATTCACTA 780 

NGTATCAACA GTAACCAACT GCTACAAGGC . GGCTGTGGAT GCCTATCTTG .AAAGTCCTGA ■ 840 

AAAGTTTGAA GCTATCAAAC AAG AC TTGGT GGACGAGATG TGGAAGGTTG CCCAACGTGA ' 900 

AC TGGCTAC A GGATTTTACT ATGGTACACC ATCTGAAAAT GAGCAGTTGT TTGGTGCTCG 960 

TCGTAAAATC C CTGAGT AC A AGTTTGTCGC TGAAGTGGTT TC TTATGATG ATGCGGCACA 102 0 

AACAGCAACT ATTCGTCAAC GAAAC GTC AT T AAC GAAGGG GACCAAGTTG AGTTTTATGG 1080 

TCCAGGTTTC CGTCATTTTG AAACCTATAT TGAAGATTTG CATGATGCTA AAGGCAATAA 1140 

AATCGACCGC GCTCCAAATC CAATGGAACT ATTGACTATT AAAGTCCCAC AACCTGTTCA 12 00 

ATCAGGAGAC ATGGTTCGAG CTCT TAAAGA~ GGGGCTTATC AATCTTTATA AGGAAGATGG 12 60 

AACCAGCGTC ACAGTTCGTG CT 12 8 2 



(2) INFORMATION FOR SEQ ID NO:172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 



(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Leu Ser Pro Ala Gly Thr Leu 
10 15 

Gly Ala Asp Ala Val Phe lie 
30 

Arg Ala Gly Asn Phe Thr Phe 
45 

Ala Ala Lys Tyr Gly Ala Lys 
60 

His Glu Gly Asn Glu Ala Gly 
75 . 80 

Asp He Gly He Ala Ala Val 
90 95 

He Ala Val Thr Glu Ala Pro 
110 

Ala Ser Ala Thr Asn Tyr Glu 

125 : : 

Leu Thr Arg Val Val Leu Ala 
140 



Thr Lys Thr Leu Lys Arg Pro Glu Val 
1 5 

Glu Lys Leu Lys Val Ala Val Gin Tyr 
20 25 

Gly Gly Gin Ala Tyr Gly Leu Arg Ser 
35 40 . 

Glu Gin Met Glu Glu Gly Val Gin Phe 
50 \ .55 

Val Tyr Val Ala Ala Asn Met. Val Met 
65 - 70 

Ala Gly Glu Trp Phe Arg Lys Leu Arg 

He Val Ser Asp Pro Ala Leu He Met 
100 105 

Gly Leu Glu He His Leu Ser Thr Gin 

- 115 ... . - 120 

Thr Leu Glu Phe Trp Lys Glu Leu Gly 
130 " 135 
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Arg Glu Val Ser Met Glu Glu Leu Ala Glu lie Arg Lys Arg Thr Asp 
145 150 155 - 160 

Val Glu lie Glu Ala Phe Val His Gly Ala Met Cys lie Ser Tyr Ser 
165 170 175 

Gly Arg Cys Thr Leu Ser Asn His Met Ser Met Arg Asp Ala Asn Arg 
180 185 190 

Gly Gly Cys Ser Gin Ser Cys Arg Trp Lys Tyr Asp Leu Tyr Asp Met 
195 200 205 

Pro Phe Gly Lys Glu Arg Lys Ser Leu Gin Gly Glu lie Pro Glu Glu 
210 215 220 

Phe Ser Met Ser Ala Val Asp Met Ser Met lie Asp. Xaa lie Pro Asp 
225 230 235 240 

Met lie Glu Asn Gly Val^Asp Ser Leu Lys lie Glu Gly Arg Met Xaa 
245 " 250 " 255 

Ser lie His Xaa Val Ser Thr Val Thr Asn Cys Tyr Lys Ala Ala Val 
260 265 270 

Asp Ala Tyr Leu Glu Ser Pro Glu Lys Phe Glu Ala lie Lys Gin Asp 
275 280 285 

Leu Val Asp Glu Met Trp Lys Val Ala Gin Arg Glu Leu Ala Thr Gly 
290 295 300 

Phe Tyr Tyr Gly Thr Pro Ser Glu Asn Glu Gin Leu Phe Gly Ala Arg 
.305 310 , 315 320 

Arg Lys lie Pro Glu Tyr Lys . Phe Val Ala Glu Val Val Ser Tyr Asp 
325 330 335 

Asp Ala Ala Gin Thr Ala Thr lie Arg Gin Arg Asn Val He Asn Glu 
340 345 350 

Gly Asp Gin Val Glu Phe Tyr Gly Pro Gly Phe Arg His Phe Glu Thr 
355 360 365 

Tyr He Glu Asp Leu His Asp Ala Lys Gly Asn Lys He Asp Arg Ala 
370 * 375 380 

Pro Asn Pro Met Glu Leu Leu Thr lie Lys Val Pro Gin Pro Val Gin 
385 390 395 . 400 

Ser Gly Asp Met Val Arg Ala Leu Lys Glu Gly Leu He Asn Leu Tyr 
405 410 - . . 415 

Lys Glu Asp Gly Thr Ser Val Thr Val Arg 'Ala 
420 _ 425 

(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid .__ . 

(C) STRANDEDNESS: double — 

(D) - TOPOLOGY: linear 



# 
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{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TTCTCAGGAG ACCTTTAAAA ATATCACCAA TAGCTTCTCC ATGCAAATCA -ATCGTCGCGT 60 

CAACCAAGGA ACGCCTCGTG GTGCTGGGAA TATCAAGGGT_GAAGACATCA AAAAAATCAC . 120 

CGAAAACAAG GCCATTGAGT CTTATGTCAA ACGTATCAAC GCTATCGGAG ATTTGACTGG 18 0 

ATATGAC CTG ATTGAAAC GC CAGAAACCAA GAAGAATCTC ACTGCTGATC GTGCCAAGCG 24 0 

TTTTGGAAGT AGCTTGATGA TTACAGGTGT CAATGACTCC TCTAAAGAAG ACAAGTTTGT 3 00 

CTCTGGTTCT TATAAACTAG TCGAAGGAGA GCACTTAACC AACGACGACA AGGATAAAAT 3 60 

CCTCTTGCAC AAGGAC TTGG CAGCCAAACA CGGCTGGAAA GTAGGGGACA AGGTTAAACT 420 

GGACTCTAAT ATCTACGATG CAGATAATGA AAAAGGAGCC AAGGAAACAG TTGAAGTGAC 480 

AATCAAGGGA CTCTTTGATG GTCATAATAA GTCAGCAGTA ACCTACTCAC AAGAACTTTA 540 

CGAAAACACA GCTATTACAG ACATTCACAC TGCTGCAAAA CTTTATGGAT ACACAGAAGA 600 

CACAGCCATT TATGGGGACG CAACCTTCTT TGTAACAGCA GACAAGAACT TGGATGATGT 660 

TATGAAAGAG TTGAATGGCA TCAGTGGTAT C AAC TGGAAG AGCTACACAC TCGTCAAGXG 72 0 

CTCCTCTAAC TACCCAGCTC TTGAGCAATC TATCTCTGGT ATGTACAAGA TGGCCAAC 778 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

■ (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Ser Gin Glu Thr Phe Lys Asn lie Thr Asn Ser Phe Ser Met Gin lie 

1 '5 10 15 - 

Asn Arg Arg Val Asn Gin Gly Thr Pro Arg Gly Ala Gly Asn lie Lys 
20 25 30 

Gly Glu Asp lie Lys Lys lie Thr Glu Asn Lys Ala lie Glu Ser Tyr 
35 40 45 

Val Lys Arg lie Asn Ala lie Gly Asp Leu Thr Gly Tyr Asp' Leu lie 
50 .55 60 

Glu Thr Pro Glu Thr Lys Lys Asn Leu Thr Ala Asp Arg Ala Lys Arg 
65 70 75 80 

Phe Gly Ser Ser Leu Met lie Thr Gly Val Asn Asp Ser Ser Lys Glu 

85 90 -95 - 



Asp Lys Phe Val Ser Gly Ser Tyr Lys Leu Val" Glu Gly Glu His Leu 
100 ' 10'5 ... 110 
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Thr Asn Asp Asp Lys Asp Lys lie Leu Leu His Lys Asp Leu Ala Ala 
115 120 125 

Lys His Gly Trp Lys Val Gly Asp Lys Val Lys Leu Asp Ser Asn lie 
130 135 140 

Tyr Asp Ala Asp Asn Glu Lys Gly Ala Lys Glu Thr Val Glu Val Thr 
145 150 155 160 

lie Lys Gly Leu Phe Asp Gly His Asn Lys Ser Ala Val Thr Tyr Ser 
165 170 175 

Gin Glu Leu Tyr Glu Asn Thr Ala lie Thr Asp lie His Thr Ala Ala 
180 185 190 

Lys Leu Tyr Gly Tyr Thr Glu Asp Thr Ala lie Tyr Gly Asp Ala Thr 
195 200. ' 205 

Phe Phe Val Thr Ala Asp Lys Asn Leu Asp Asp Val Met Lys Glu Leu 

210 215 220 '" """ 

Asn Gly lie Ser Gly lie Asn Trp Lys Ser Tyr Thr Leu Val Lys Ser 
225 230 235 240 

Ser Ser Asn Tyr Pro Ala Leu Glu Gin Ser lie Ser Gly Met Tyr Lys 
245 , 250 255 

Met Ala Asn 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE: nucleic acid 

(C) - STRANDEDNESS :- double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AGTAAATGCG CAATCAAATT CATTAATATT AATAGATGAA CCTGAAATCT CACTTCATCC 60 

GAGTGCAATC TATAAATTTA AAGAGTTTTT ACTTCAAGAG TGTTTAAATA AAAAACATCA- , 12 0 

AATTATTATC ACTACACATT CTACACAACT TATAAAAGAT TTTC CTAGAG AAGC CGTGAA 18 0 

ACTTTTAGTG AAAAACGGAG AAAAGGTAGA TGTTATTGAA AATATTGATT ATCAGGATGC 240 

ATTTTTTGAA TTAGGTGATG TGTATCATTC TAGGAAGATG ATTTATGTTG AAGATAGACT 3 00 

AGCTAAATAT ATTCTAGAGT TTGTTATCAC TCATTCAGGT AGTGAGAATC TTAAACAGAA 3 60 

TTTAGTAGTG AGATATATTC CTGGTGGAGC AAATCAAATA ATTTGTAATA ATATTTTAAA 420 

CTCATCGTAT TTAGATTCCG ATAACCATTA TTTTTGGCTT GATGGAGATC AAAACACTAA -480 

TGTTAGTGAA • TCAAATAATT TAATGAACTA TCTTGAAAAT GGTGTTGTTA TATCAGATAA 540 

AATTCCTGAA TCAGATAATA AAAATCTTGA TGATATTATA AAATTGATAA NGGGATGTCC 600 
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AATTAAATTT AATGTTTCAG GTAATAAAGG GCAAAAAAAT AATATTGAAT TAATTGCGAA 66 0 

ACAAAGAAGC TTTATAGATT ATTGGGCTAA ATAC 694 
(2) INFORMATION FOR SEQ ID NO:176 : - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

Val Asn Ala" Gin Ser Asn Ser Leu lie Leu lie Asp Glu Pro Glu lie 
1 5 10-15 

Ser Leu His Pro Ser Ala lie Tyr Lys Phe Lys Glu Phe Leu Leu Gin 
20 25 30 

Glu Cys Leu Asn Lys Lys His Gin lie lie lie Thr Thr His Ser Thr 
35 40 45 

Gin Leu lie Lys Asp Phe Pro Arg Glu Ala Val Lys Leu Leu Val Lys 
50 55 60 

Asn Gly Glu Lys Val Asp Val lie Glu Asn lie Asp Tyr Gin Asp Ala 
65 70 75 80 

Phe Phe Glu Leu Gly Asp Val Tyr His Ser Arg Lys Met lie Tyr. Val 
85 90 95 

Glu Asp Arg Leu Ala Lys Tyr lie Leu Glu Phe Val lie Thr His Ser 
100 ' 105 110 

Gly Ser Glu Asn Leu Lys Gin Asn Leu Val Val Arg Tyr lie Pro Gly 
115 120 125 

Gly Ala Asn Gin lie lie Cys Asn Asn- lie Leu Asn Ser Ser Tyr Leu 
130 135 140 

Asp Ser Asp Asn His Tyr Phe Trp Leu Asp Gly Asp Gin Asn Thr Asn 
145 150 155 160 

Val Ser Glu Ser Asn Asn Leu Met Asn Tyr Leu Glu Asn Gly Val Val 
165 ■ 170 175 

lie Ser Asp Lys lie Pro Glu Ser Asp Asn Lys Asn Leu Asp Asp lie 
180 185 190 

lie Lys Leu lie Xaa Gly Cys Pro lie Lys Phe Asn Val Ser Gly Asn 
195 20Q 205 

Lys Gly Gin Lys Asn. Asn lie Glu Leu lie Ala Lys Gin Arg Ser Phe 
210 215. 220 

lie Asp Tyr Trp. Ala Lys Tyr 

- 225 .230 



(2) INFORMATION FOR SEQ ID NO: 177: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid' 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TTACCGCGTT CATCAAGATG" TCAAACAAGT CATGACCTAT CAACCCATGG TGCGAGAAAT 60. 

ATTGAGTGAA CAAGACACCC CAGCAAACGA AGAGCTTGTG CTTGCTATGA TTTATACTGA 120 

AACAAAAGGA AAAGAAGGCG ATGTTATGCA GTCTAGTGAG TCTGCAAGTG GTTCCACCAA 180 

CACCATCAAT GATAATGCCT CTAGCATTCG GCAAGGCATT CAAACTCTGA CAGGCAATCT 240 

CTATCTGGCG CAGAAGAAGG GGGTAGATAT CTGGACAGCT GTTCAAGCCT ATAATTTTGG 300 

ACCTGCCTAT ATC GATTTT A TCGCCCAAAA TGGCAAGGAA AATACCCTGG CTCTAGCCAA 3 60 

ACAGTACTCT CGTGAGACTG TTGCCCCCTT GCTTGGTAAT AGGACTGGAA AGACTTATAG 42 0 

TTATATTCAC CCCATTTCCA TTTTTC AC GG TGCTGAACTC TATGTAAATG GAGGAAACTA . 48 0 

TTATTATTCT AGACAGGTAC GACTTAACCT TTACATCATC AAATGTTTCA CTCTCTTTTC 540 
AACATCTGGC ' 550 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

.Tyr Arg Val His Gin Asp Val Lys Gin Val Met Thr Tyr Gin Pro Met 
1 5 10 15 

Val Arg Glu lie Leu Ser Glu Gin Asp Thr Pro Ala Asn Glu Glu Leu 
20 25 30 

Val Leu "Ala Met He Tyr Thr Glu Thr Lys Gly Lys Glu Gly Asp Val 
35 40 45 

Met Gin Ser Ser Glu Ser Ala Ser Gly Ser Thr Asn Thr He Asn Asp 
50 55 60 

Asn Ala Ser Ser He Arg Gin Gly He Gin Thr Leu Thr Gly Asn Leu 
65 ' ' * 70 75 80 



-Tyr Leu Ala- Gin Lys Lys Gly Val Asp He Trp Thr Ala Val Gin Ala 

85 • " . 90 95 . 

Tyr Asn Phe Gly Pro Ala Tyr lie Asp Phe He Ala Gin Asn Gly Lys 
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100 105 110 

Glu A'sn Thr Leu Ala Leu Ala Lys Gin Tyr Ser Arg Glu Thr Val Ala 
115 120 125 

Pro Leu Leu Gly Asn Arg Thr Gly Lys Thr Tyr Ser Tyr lie His Pro 
130 135 140 

He Ser He Phe His Gly Ala Glu Leu Tyr Val Asn Gly Gly Asn Tyr 
145 150 155. 160 

- - Tyr Tyr Ser Arg Gin Val Arg Leu Asn Leu Tyr He lie Lys Cys Phe 

165 170 175 

Thr Leu Phe Ser Thr Ser Gly 
180 

(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GTGGATGGGC TTTAACTATC TTCGTATTCG CCGTGCGGCT AAAATTGTGG ACAATGAGGA 60 

GTTTGAAGCC TTGATTCGTA CGGGTCAATT GATTGATTTG CGCGACCCAG CAGAATTCCA 12 0 

CAGAAAACAT ATCCTTGGTG CACGCAATAT TCCTTCAAGT CAGTTGAAAA CTAGTCTTGC 18 0. 

AGCCCTTCGT AAAGATAAAC CTGTCCTTCT CTACGAAAAC CAACGTGCGC AACGAGTTAC 240 

AAATGCAGCT CTTTACTTGA AAAAACAAGG TTTTTC TG AG ATTTATATCC TTTCTTATGG 3 00 

CTTGGATTCT TGGAAAGGGA AAGTGAAGAC TAGC 3 34 
(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

Trp Met Gly Phe Asn Tyr Leu Arg He Arg Arg Ala Ala Lys He Val 
1 5 - 10 15 

Asp Asn Glu Glu Phe Glu Ala Leu He Arg Thr Gly Gin Leu He Asp 

20 25 . 30 , 



Leu Arg Asp Pro Ala Glu Phe' His Arg Lys His lie Leu Gly Ala Arg 
35- 40 45 
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Asn He Pro Ser Ser Gin Leu Lys Thr Ser Leu Ala Ala Leu Arg- Lys 
50 55 60 

Asp Lys Pro Val Leu Leu Tyr Glu Asn Gin Arg Ala Gin Arg Val Thr 
65 '70 75 80 

Asn Ala Ala Leu Tyr Leu Lys Lys Gin Gly Phe Ser Glu He Tyr He 
85 90 95 

Leu Ser Tyr Gly Leu Asp Ser Trp Lys Gly Lys Val Lys Thr Ser 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 181: 

( i ) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 181: 

AC T AAAC C AG CATCGTTCGC AGGAAAATAA GGACAATAAT CGTGTCTCTT ATGTGGATGG 60 

CAGCCAGTCA AGTCAGAAAA GTGAAAACTT G AC AC C AGAC CAGGTTAGCC AGAAAGAAGG 12 0 

AATTCAGGCT GAGCAAATTG TAATCAAAAT TACAGATCAG GGCTATGTAA CGTCACACGG 180 

TGACCACTAT CATTACTATA ATGGGAAAGT TCCTTATGAT GCCCTCTTTA GTGAAGAACT 240 

CTTGATGAAG GATCCAAACT ATCAACTTAA AGACGCTGAT ATTGTCAATG AAGTCAAGGG 3 00 

TGGTTATATC ATCAAGGTCG ATGGAAAATA TTATGTCTAC CTGAAAGATG CAGCTCATGC 3 60 

TGATAATGTT CGAACTAAAG ATGAAATCAA TCGTCAAAAA CAAGAACATG TCAAAGATAA 420 

TGAGAAGGTT AACTCTAATG TTGCTGTAGC AAGGTC TC AG GGACGATATA CGACAAATGA 48 0 

TGGTTATGTC TTTAATCCAG CTGATATTAT C GAAGATACG GGTAATGCTT ATATCGTTCC 540 

TCATGGAGGT CACTATCACT ACATTCCCAA AAGCGATTTA TCTGCTAGTG AATTAGCAGC 600 

AGCTAAAGCA CATCTGGCTG GAAAAAATAT GCAAC CGAGT CAGTTAAGCT ATTCTTCAAC 660 

AGCTAGTGAC AATAACACGC AATC TGTAGC AAAAGGATCA ACTAGCAAGC C AGC AAATAA 72 0 

ATCTGAAAAT CTCCAGAGTC TTTTGAAGGA ACTCTATGAT TCACCTAGCG CCCAACGTTA 780 

CAGTGAATCA GATGGCCTGG TCTTTGACCC TGCTAAGATT ATCAGTCGTA C AC C AAATGG 840 

AGTTGCGATT CCGCATGGCG ACCATTACCA CTTTATTCCT TACAGCAAGC TTTCTGCCTT 900 

AGAAGAAAAG ATTGC CAGAA TGGTGCCTAT CAGTGGAACT GGTTCTACAG TTTCTACAAA 960 

TGCAAAACCT AATGAAGTAG TGTCTAGTCT AGGCAGTCTT TCAAGCAATC CTTCTTCTTT 102 0 

"AACGACAAGT AAGGAGCTCT CTTCAGCATC TGATGGTTAT ATTTTTAATC CAAAAGATAT 1080 

CGTTGAAGAA ACGGCTACAG CTTATATTGT AAGACATGGT GATCATTTCC ATTACATTCC .1140 

"AAAATCAAAT CAAATTGGGC AACCGACTCT TCCAAACAAT AGTCTAGCAA CACCTTCTCC 1200 
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ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT GAAGAAGATG GATACGGATT 1260 

TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT GTCATGAGTC AC GGAGACC A 13 2 0 

CAATCATTAT TTCTTCAAGA AG 1342 
(2) INFORMATION FOR SEQ ID NO: 182: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(.xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

- Leu Asn Gln-H-is -Arg Ser Gin Glu Asn Lys Asp Asn Asn Arg Val Ser 
1 5 10 15 

Tyr Val Asp Gly Ser Gin Ser Ser Gin Lys Ser Glu Asn Leu Thr Pro 
20 25 30 

Asp Gin Val Ser Gin Lys Glu Gly lie Gin Ala Glu Gin ^Ile Val lie 
35 40 45 

Lys lie Thr Asp Gin Gly Tyr Val Thr Ser His Gly Asp His Tyr ' His 
50 55 k 60 

Tyr Tyr Asn Gly Lys Val Pro Tyr Asp Ala Leu Phe Ser Glu Glu -Leu 
65 70 75 80 

Leu Met Lys Asp Pro Asn Tyr Gin Leu Lys Asp Ala Asp lie Val Asn 
85 90 95 

Glu Val Lys Gly Gly Tyr lie lie Lys Val Asp Gly Lys Tyr Tyr Val 
100 105 110 

Tyr Leu Lys Asp Ala Ala His Ala Asp Asn Val Arg Thr Lys Asp Glu 
115 1-20 125 

lie Asn Arg Gin Lys Gin Glu His Val Lys Asp Asn Glu Lys Val Asn 
130 ' 135 140 

Ser Asn Val Ala Val Ala Arg Ser Gin Gly Arg Tyr Thr Thr Asn Asp 
145 150 155 160 

Gly Tyr Val Phe Asn Pro Ala Asp lie lie Glu Asp Thr Gly Asn Ala 
165 170 175 



Tyr lie Val Pro His Gly Gly 'His Tyr His Tyr lie Pro Lys Ser Asp 
180 185 190 

Leu Ser Ala Ser Glu Leu Ala Ala Ala Lys Ala His . Leu Ala Gly Lys 
195 . 200 205 

Asn Met Gin . Pro Ser Gin Leu Ser Tyr Ser S e r Thr Ala S er Asp Asn 
210' ~ 215 " 220 . 



Asn Thr Gin Ser Val Ala Lys Gly Ser Thr Ser Lys Pro Ala . Asn Lys 
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225 230 235 240 

Ser Glu Asn Leu Gin Ser Leu Leu Lys Glu Leu Tyr Asp Ser Pro Ser 
245 250 255 

Ala Gin Arg Tyr Ser Glu Ser Asp Gly Leu Val Phe Asp Pro Ala Lys 
260 265 270 

lie lie Ser Arg Thr Pro Asn Gly Val Ala lie Pro His Gly Asp His 
275 280 285 

Tyr His Phe lie Pro Tyr Ser Lys Leu Ser Ala Leu Glu Glu Lys lie 
290 295 300- 

Ala Arg Met Val Pro lie Ser Gly Thr Gly Ser Thr Val Ser Thr Asn 
305 r 310 315 320 

Ala Lys Pro Asn Glu Val Val Ser Ser Leu Gly Ser Leu Ser Ser Asn 
325 330 335 

Pro Ser Ser " Leu ; Thr Thr™Ser~ Lys Glu Leu Ser Ser Ala Ser Asp Gly 
340 345 350 

Tyr lie Phe Asn Pro Lys Asp lie Val Glu Glu Thr Ala Thr Ala Tyr 
355 360 365 

lie Val Arg His Gly Asp His Phe His Tyr lie Pro Lys Ser Asn Gin 
370 375 380 

lie Gly Gin Pro Thr Leu Pro Asn Asn Ser Leu Ala Thr Pro Ser Pro 
385 ^ 390 395 400 

Ser Leu Pro lie Asn Pro Gly Thr Ser His Glu Lys His Glu Glu Asp 
405- 410 415 

Gly Tyr Gly Phe Asp Ala Asn Arg lie lie Ala Glu Asp Glu Ser Gly 
420 425 430 

Phe Val Met Ser His Gly Asp His Asn His Tyr Phe Phe Lys Lys 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
TG AC T AC C TT GAAATCCCAC TTTACAGCTA TCTTGGTGGA TTCAACACTA AAGTTCTTCC 
AACTCCAATG ATGAACATCA TCAACGGTGG TTCTCACTCT GACGCTCCAA TCGCTTTCCA 
AGAGTTCATG ATCTTGCCAG TTGGTGCGCC AACATTTAAA ' GAAGCCCTTC GTTACGGTGC 
TGAAATC TTC CACGCTCTTA AG AAAATC CT TAAATCACGT GGTTTGGAAA CTGCCGTAGG._ 



TGACGAAGGT GGATTCGCTC CTCGTTTCGA AGGAACTGAA GATGGTGTTG AAACTATCCT 



■ pi 
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TGCTGCGATT GAAGCTGCTG GATATGTACC AGGTAAAGAC GTATTTATCG GATTTGACTG 36 0 

TGCTTCATCA GAATTCTACG ATAAAGAACG TAAAGTTTAC GACTAC AC TA AATTTGAAGG 42 0 

TGAAGGTGCT GCTGTTCGTA CATCTGCAGA ACAAATCGAC_ TACCTTGAAG AATTGGTTAA 48 0 

CAAATAC CC A ATCATCACTA TTGAAGATGG TATGGATGAA AACGACTGGG ATGGTTGGAA 54 0 

AGCTCTTACT GAACGTCTTG GTAAGAAAGT ACAACTTGTT GGTGACGACT TCTTCGTAAC 600 

AAACACTGAC TACCTTGCAC GTGGTATC C A AGAAGGTGCT GCTAACTCAA TCCTTATCAA 66 0 

AGTTAAC C AA ATCGGTACTC TTACTGAAAC TTTTGAAGCT ATCGAAATGG CTAAAGAAGC 720 

TGGTTACACT GCTGTTGTAT CACACCGTTC AGGTGAAACT GAAGATTCAA CAATCGCTGA 780 

TATTGCAGTT GCAACTAACG CAGGACAAAT CAAGACTGGT TCACTTTCAC GT AC AG AC C G . 840 

CATCGCTAAA TACAACCAAT TGCTTCGTAT CGAAGACCAA CTTGGTGAAG TAGCTGAATA 90 0 

TCGTGGATTG AAATCATTCT ACAACCTTAA AAAA" - . .™ — 934 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid ) 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:184: 

Asp Tyr Leu Glu lie Pro Leu* Tyr Ser Tyr Leu Gly Gly Phe Asn Thr 
1 5 10 15 

Lys Val Leu Pro Thr Pro Met Met Asn lie lie Asn Gly Gly Ser His 
20 25 30 

Ser Asp Ala Pro lie Ala Phe Gin Glu Phe Met lie Leu Pro Val Gly 
35 40 45 

Ala Pro Thr Phe Lys Glu Ala Leu Arg Tyr Gly Ala Glu lie Phe His 
50 55 60 

Ala Leu Lys Lys lie Leu Lys Ser Arg Gly Leu Glu Thr Ala Val Gly 
65 70 75 80 

Asp Glu Gly Gly Phe Ala Pro Arg Phe Glu Gly Thr Glu Asp Gly Val 
. 85 90 95 

Glu Thr lie Leu Ala Ala lie Glu Ala Ala Gly Tyr Val Pro Gly Lys 
100 - 105 110 

. Asp Val Phe lie Gly Phe Asp Cys Ala Ser Ser Glu Phe Tyr Asp Lys 

115 120 125 , 

Glu Arg Lys Val Tyr Asp Tyr Thr Lys Phe Glu Gly Glu Gly Ala Ala 
130 135 " "140 

Val Arg Thr Ser Ala Glu Gin lie Asp Tyr Leu Glu Glu Leu Val Asn 
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145 150 155 160 

Lys Tyr Pro lie lie Thr lie Glu Asp Gly Met Asp Glu Asn Asp Trp 
165 170 175 

Asp Gly Trp Lys Ala Leu Thr Glu Arg Leu Gly Lys Lys Val Gin Leu 
180 185 190 

Val Gly Asp Asp Phe Phe Val Thr Asn Thr Asp Tyr Leu Ala Arg Gly 
195 200 205. 

lie Gin Glu Gly Ala Ala Asn Ser lie Leu lie Lys Val Asn Gin lie 
210 215 220 

Gly Thr Leu Thr Glu Thr Phe Glu Ala lie Glu Met Ala Lys Glu Ala 

225 230 - 235 • ■ , 240 

Gly Tyr Thr Ala Val Val Ser His Arg Ser Gly Glu Thr Glu Asp Ser 
245 250 255 

Thr lie Ala Asp lie Ala Val Ala Thr Asn Ala Gly Gin lie Lys- Thr- 
260 265 270 

Gly Ser Leu Ser Arg Thr Asp Arg lie Ala Lys Tyr Asn Gin Leu Leu 
275 280 285 

Arg lie Glu Asp Gin Leu Gly Glu Val Ala Glu Tyr Arg Gly Leu Lys 
290 295 300 

Ser Phe Tyr Asn Leu Lys Lys 
305 310 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

TCGTATCTTT TTTTGGAGCA ATGTTCGCGT AGAAGGACAT TCCATGGATC CGACCCTAGC 6 0 

GGATGGCGAA ATTCTCTTCG TTGTAAAACA CCTTCCTATT GACCGTTTTG ATATCGTGGT. 12 0 

GGCCCATGAG GAAGATGGCA ATAAGGACAT CGTCAAGCGC GTGATTGGAA TGCCTGGCGA 18 0 

CACCATTCGT TACGAAAATG ATAAACTCTA CATCAATGAC AAAGAAACGG ACGAGC CTTA 240 

TCTAGCAGAC TATATCAAAC GCTTCAAGGA TGACAAACTC CAAAGCACTT ACTCAGGCAA -— 3 00 

GGGCTTTGAA GGAAATAAAG GAACTTTCTT TAGAAGTATC GCTCAAAAAG CTCAAGCCTT 3 60 

CACAGTTGAT GTCAACTACA ACACCAACTT TAGCTTTACT GTTCCAGAAG GAGAATACCT . 420 

TCTCCTCGGA GATGACCGCT TGGTTTCGAG CGACAGCCGC CACGTAGGTA CCTTCAAAGC - " — 48 0 

AAAAGATATC ACAGGGGAAG CTAAATTCCG C TTATGGC C A ATCACCCGTA. X.CGGAACATT 540 

X - 541 



290 



(2) INFORMATION FOR 'SEQ ID NO: 186: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Arg lie Phe Phe Trp Ser Asn Val Arg Val Glu Gly His Ser Met Asp 
1 5 10 15 

Pro Thr Leu Ala Asp Gly Glu lie Leu Phe Val Val Lys His Leu Pro 
20 25 30 

lie Asp Arg Phe Asp lie Val Val Ala His Glu Glu Asp Gly Asn Lys 
35 40 45 

Asp lie Val Lys Arg Val lie Gly Met Pro Gly Asp Thr lie Arg Tyr 
50 55 60 

Glu Asn Asp Lys Leu Tyr lie Asn Asp Lys Glu Thr Asp Glu Pro Tyr 
65 70 75 80 



Leu Ala Asp Tyr lie Lys Arg Phe Lys Asp Asp Lys Leu Gin Ser Thr 
85 90 95 

Tyr Ser Gly Lys Gly Phe Glu Gly Asn Lys Gly Thr Phe Phe Arg Ser 
100 105 110 

lie Ala Gin Lys Ala. Gin Ala Phe Thr Val Asp Val Asn Tyr Asn Thr 
115 120 125 

Asn Phe Ser Phe Thr Val Pro Glu Gly Glu Tyr Leu Leu Leu Gly Asp 
130 135 140 

Asp Arg Leu Val Ser Ser Asp Ser Arg His Val Gly Thr Phe Lys Ala 
145 150 155 160 

Lys Asp lie Thr Gly Glu Ala Lys . Phe Arg Leu Trp Pro lie Thr Arg 
165 170 175 

lie Gly Thr Phe 
180 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: double . 

(D) TOPOLOGY: linear ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
GGACTCTCTC AAAGATGTGA AAGCAAATGC TAGCGACAGC AAGCCTGCAcT AGGACAAGAA ' 
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GGATGCAAAA CAAGGAACGG AAGATAGTAA GGATTCAGAT AAGATGACTG AAACAAACTC 120 

' AGTTCCGGCA GGAGTGATTG TGGTCAGTCT ACTTGCCCTC CTAGGCGTGA TTGCCTTCTG 180 

GCTGATTCGC CGTAAGAAAG AGTCAGAAAT CCAGCAATTA AGCACGGAAT TGATCAAGGT 240 

TCTAGGACAG C TAG ATGC AG AAAAAGCGGA TAAAAAAGTC CTTGCCAAAG CCCAAAACCT 3 00 
TCTCCAAGAA ACCCTTGATT TCGTGAAAGA AGAAAATGGC TCAGCAGAGA CAGAAACTAA - 3 60 

ACTAGTAGAG GAGCTTAAAG CAATCCTTGA CAAAGTCAAG 400 
(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
• (D)' TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 8: 

Ala Asn Ala Ser Asp Ser Lys Pro Ala 
10 15 

Gin Gly Thr Glu Asp Ser Lys Asp Ser 
25 30 

Ser'Val Pro Ala Gly Val lie Val Val 
40 45 

Val lie Ala Phe Trp Leu lie Arg Arg 

60 ' 

Gin Leu Ser Thr Glu Leu lie Lys Val 
75 80 

Lys Ala . Asp Lys Lys Val Leu Ala Lys 
90 ' 95 

Thr Leu Asp Phe Val Lys Glu Glu Asn 
105 110 

Lys Leu Val Glu Glu Leu Lys Ala lie 
120 125 

Leu Asp . Lys Leu Lys 
130 

(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 01 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



Asp Ser Leu Lys Asp Val Lys 
1 5 

Gin Asp Lys Lys Asp Ala Lys 
20 

Asp Lys Met Thr Glu Thr Asn 
35 

Ser Leu Leu Ala Leu Leu Gly 
50 55 

Lys Lys Glu Ser Glu lie Gin 
65 70 

Leu Gly Gin Leu Asp Ala Glu 
85 

Ala Gin Asn Leu Leu Gin Glu 
100 

Gly Ser Ala Glu Thr Glu Thr 
115 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

CAAGAAATCC TATCATCTCT TCCAGAAGCA AACAGAGACG AGGGGAATTC AGACTCAGTT 60 

GATTGAAGAA TCGCTTAGTC AGCAGACTAT AATCCAGTCC TTCAATGCTC AAACAGAATT-. . 120 

TATCCAAAGA TTGCGTGAGG CTCATGACAA CTACTCAGGC TATTCTCAGT CAGCCATCTT 180 

TTATTCTTCA ACGGTCAATC CTTCGACTCG CTTTGTAAAT GCACTCATTT ATGCCCTTTT 240 

AGC TGGAGTA GGAGCTTATC GTATCATGAT GGGTTC AGC C TTGACCGTCG GTCGTTTAGT 3 00 

GACTTTTTTG AACTATGTTC AGC AATAC AC CAAGCCCTTT AACGATATTT CTTCAGTGCT 3 60 

AGCTGAGTTG CAAAGTGCTC TGGCTTGCGT AGAGC GTATC TATGGAGTCT TAG AT AGC C C 420 

TGAAGTGGCT GAAACAGGTA AGGAAGTCTT GACGACCAGT GACCAAGTTA AGGGAGCTAT 480 

TTCCTTTAAA CATGTCTCTT TTGGCTAC C A TCCTGAAAAA ATTTTGATTA AGGACTTGTC 540 

TATCGATATT CC AGC TGGTA GTAAGGTAGC CATCGTTGGT CCGACAGGTG CTGGAAAATC 600 

AACTCTTATC AATCTCCTTA TGCGTTTTTA TCCCATTAGC TC GGGAGAT A TCTTGCTGGA 660 

TGGGCAATCC ATTTATGATT ATACACGAGT ATCATTGAGA CAGCAGTTTG GTATGGTGCT 720 

TCAAGAAACC TGGCTCACAC AAGGG AC CAT TCATGATAAT ATTGCCTTTG GCAATCCTGA 780 

AGCCAGTCGA GAGCAAGTAA TTGCTGCTGC CAAAGCAGCT AATGCAGACT TTTTCATCCA 840 

ACAGTTGCCA CAGGGATACG ATACCAAGTT GGAAAATGCT GGAGAATCTC TCTCTGTCGG 90.0 
CCAAGCTCAG CTCTTGACCA TAGCCCGAGT CTTTCTGGCT ATTCCAAAGA TTCTTATCTT - 960 

AGACGAGGCA ACTTCTTCCA TTGATACACG GACAGAAGTG CTGGTACAGG ATGCCTTTGC 1020 

AAAACTCATG AAGGGCCGCA CAAGTTTCAT C ATTGC TC AC CGTTTGTCAA C CATTC AGGA 1080 

TGC GGATTT A ATTCTTGTCT TAGTAGATGG TGATATTGTT GAATATGGTA ACCATCAAGA 1140 
ACTCATGGAT AGAAAGGGTA AGTATTACCA AATGCAAAAA GCTGCGGCTT TTAGTTCTGA - 1200 

A 1201 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Lys Lys Ser Tyr His Leu Phe Gin Lys Gin Thr Glu Thr Arg Gly lie 
1 5 10 15 " 

Gin Thr Gin Leu lie Glu Glu Ser Leu Ser Gin Gin Thr lie lie Gin 
20 25 30 
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Ser Phe Asn Ala Gin Thr Glu Phe lie Gin Arg Leu Arg Glu Ala His 
35 40 45 

Asp Asn Tyr Ser Gly Tyr Ser Gin Ser Ala lie Phe Tyr Ser Ser Thr 
50 55 60 

Val Asn Pro Ser Thr Arg Phe Val Asn Ala Leu lie Tyr Ala Leu Leu 
65 70 75 80 

Ala Gly Val Gly Ala Tyr Arg lie Met Met Gly Ser Ala Leu Thr Val 
85 90 95 

Gly Arg Leu Val Thr Phe Leu Asn Tyr Val Gin Gin Tyr Thr Lys Pro 
100 105 110 

Phe Asn Asp lie Ser Ser Val Leu Ala Glu Leu Gin Ser Ala Leu Ala 
115 120 125 

Cys Val Glu Arg lie Tyr Gly Val Leu Asp Ser Pro Glu Val Ala Glu 
130 ^ 135 140 

Thr Gly Lys Glu Val Leu Thr Thr Ser Asp Gin Val Lys Gly Ala lie 
145 150 155 160 

Ser Phe Lys His Val Ser Phe Gly Tyr His Pro Glu Lys lie Leu lie 
165 170 175 

Lys Asp Leu Ser lie Asp lie Pro Ala Gly Ser Lys Val Ala lie Val 
180 185 190 

Gly Pro Thr Gly Ala Gly Lys Ser Thr Leu lie Asn Leu Leu Met Arg 
195 200 " * 205 

Phe Tyr Pro lie Ser Ser Gly Asp lie Leu Leu Asp Gly Gin Ser lie 
210 215 220 

Tyr Asp Tyr Thr Arg Val Ser Leu Arg Gin Gin Phe Gly Met Val Leu 
225 230 235 240 

Gin Glu Thr Trp Leu Thr Gin Gly Thr lie His Asp Asn lie Ala Phe 
245 250 255 

Gly Asn Pro Glu Ala Ser Arg Glu. Gin Val lie Ala Ala Ala Lys Ala 
260 265 270 

Ala Asn Ala Asp Phe Phe lie Gin Gin Leu Pro Gin Gly Tyr Asp Thr 
275 280 285 

Lys Leu Glu Asn Ala Gly Glu Ser Leu Ser Val Gly Gin Ala Gin Leu 
290 295 300 

Leu Thr lie Ala Arg Val Phe Leu Ala lie Pro Lys lie Leu lie Leu 
305 . 310 315 320 

Asp Glu Ala Thr Ser Ser lie Asp Thr Arg Thr Glu Val Leu-Val Gin 
325 330 335 

Asp Ala Phe Ala Lys Leu Met Lys Gly Arg Thr Ser Phe lie lie Ala 
340 345 350 



His Arg Leu Ser Thr lie Gin Asp Ala Asp Leu lie Leu Val Leu Val 
355 . 360 365 
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Asp Gly Asp lie Val Glu Tyr Gly Asn His Gin Glu Leu Met Asp Arg 
370 375 380 

Lys Gly Lys Tyr Tyr Gin Met Gin Lys Ala Ala Ala Phe Ser Ser Glu 

385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 base pairs 

(B) TYPE: nucleic acid 

(C) - STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

AATTGAAAAG GCGGCAGTTA GCCAAGGAGG 60 

AGACGCAGAC TTGCACGAAA TTTATCTAGC 120 

TTTCTCACGT GTTCCCGGGG TGACGGATGC 18 0 

AACAACCAAG TACGAATTGA TTAACCAAAC 240 

TGATGCCAAG CAAATTTCTC TCAAGGAAAT 300 

AACCAGCAAA AATAAACAAG GAAATGATGT 3 60 

CACAGATGAC AAGGATTTGG AAGTGATTAA 420 

CGATCAACCT CTAGCAGTTG AAAAGGAAAA 480 

CCATCAAGAC .TATCTCAAGA AAAATCCAAA 540 

GGCCTATCCT GTCATTGATG CCAGCAAATA 600 

GACCCTGTCA CCTGAGGAGT ATGCAGTTAC 660 

AAACCGTTAC TGGGATAAAT TTGAATCCGG 72 0 

TCTCTTTTCA TCAAAAGACA AATTTGAGTC 780 

CATCAGTCCA GATGTTGTCA CCTACAAGGA . 840 

AGTGCGGAGC C GAGTAGG AG ATTCTCACCT 900 

CAAGGGCGGC TTACGTTACT GTATCAATAG .960 

AATGGAAGAA AAAGGCTACG CTTATTTACT 102 0 
AGATTATGTT GAT .1033 
(2) INFORMATION FOR SEQ ID NO:192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids ~ 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein " 



ACGAAATGCA 
y ■ " AAAAGCAGTG 

tggaggttgt 

iiiji 

im cgtttcaggc 
aggtcatgca 
^ cctgcttcac 
q ggggac c c ag 

fy CCAAGTCTTT 
:j=2 CTTGAAGAAT 
TGGCTACTGC 
TCCAAAACCA 
CCAGGAAAAT 
TATCTATGTG 
TGGTTGTGGC 
AGATAAGTCC 
TGGGCATGTC 
CCTCTCTATC 



GGGCAGACAG ATGCCTCGCA 
AAAAAAACAG AAATTAGTAA 
TTC TGGGG AG TGGAGGAATA 
TATGCAAATG GTAGAGGAGA 
GAAACCGTCC ATGTCACCTA 
TATTTCCGCA TTATCAATCC 
TACCGTACTG GTGTTTATTA 
GATGAGGTGG CTAAGAAATA 
TTTGTGGTGG CTGAGGATTA 
CATATCAATG TTAATCAGGC 
AGTGATGAGG AATTGAAAAA 
CAAACAGAAC GAGCTTTCTC 
GATATAGCAA CTGGGGAACC 
TGGCCTAGTT TTACCCAACC 
TACAATATGA CGC GTATGGA 
TTTAC GGATG GTCCACAGGA 
CGC TTTATTC CCAAAGACCA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 



Arg Asn._Ala Gly Gin Thr Asp Ala Ser Gin lie Glu Lys Ala Ala Val 
1 . 5 10 15 

" Ser Gin Gly Gly Lys Ala Val Lys Lys Thr Glu lie Ser Lys Asp Ala 
20 25 30 

Asp Leu His Glu lie Tyr Leu Ala Gly Gly Cys Phe Trp Gly Val Glu 
. 35 40 45 

Glu Tyr Phe Ser Arg Val. Pro Gly Val Thr Asp Ala Val Ser Gly Tyr 
50 55 .60 

Ala Asn Gly Arg Gly Glu Thr Thr Lys Tyr Glu Leu lie Asn Gin Thr 
65 70 '75 80 

Gly His Ala Glu Thr Val His Val Thr Tyr Asp Ala Lys Gin lie Ser 

■ - - 85 90 95 „ 

Leu Lys Glu lie Leu Leu His Tyr Phe Arg lie lie Asn Pro ' Thr Ser 
100 105 110 

Lys Asn Lys Gin Gly Asn Asp Val Gly Thr Gin Tyr Arg Thr Gly Val 
115 120 125 

Tyr Tyr Thr Asp Asp Lys Asp Leu Glu Val lie Asn Gin Val Phe Asp 
130 135 140 

Glu Val Ala Lys Lys Tyr Asp Gin Pro Leu Ala Val Glu Lys Glu Asn 
145 150 ' 155 160 

Leu Lys Asn Phe Val Val Ala Glu Asp Tyr His Gin Asp Tyr Leu Lys 
165 * 170 175 

Lys Asn Pro Asn Gly Tyr Cys His lie Asn Val Asn Gin Ala Ala Tyr 
180 185 190 

Pro Val lie Asp Ala Ser Lys Tyr Pro Lys Pro Ser Asp Glu Glu Leu 
195 200 205 

- Lys Lys Thr Leu Ser Pro Glu Glu Tyr Ala Val Thr Gin .Glu Asn Gin 
210 215 220 

Thr Glu Arg Ala Phe Ser Asn Arg Tyr Trp Asp Lys Phe Glu Ser Gly 
225 230 235 240 

lie Tyr Val Asp lie- Ala Thr Gly Glu Pro Leu Phe Ser Ser Lys Asp 

. 245 ' 250 255 

Lys Phe Glu Ser Gly Cys Gly Trp Pro Ser Phe Thr Gin Pro lie Ser 
260 265 270 

Pro Asp Val Val Thr Tyr Lys Glu Asp Lys Ser Tyr Asn Met Thr Arg 
275 280 . 285 

Met Glu Val Arg Sef'Arg Val Gly Asp Ser His Leu Gly His Val Phe 
290 1 " 295 30 0 



. Thr Asp Gly Pro Gin Asp Lys Gly Gly Leu Arg Tyr Cys lie Asn Ser 
305 310 315 320 



296 



.. .Leu Ser lie Arg Phe lie Pro Lys Asp Gin Met Glu Glu Lys Gly Tyr 
325 330 335 

Ala Tyr Leu Leu Asp Tyr Val Asp 
340 

(2) INFORMATION FOR~SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 
TGTATAGTTT TTAGCGCTTG TTCTTCTAAT TCTGNTAAAA ATGAAGAAAA TACTTCTAAA 60 



GAGCATGCGC CTGATAAAAT AGTTTTAGAT CATGCTTTCG GTCAAACTAT ATTAGATAAA 120 

AAACCTGAAA GAGTTGCAAC TATTGCTTGG GGAAATCATG ATGTAGCATT AGCTTTAGGA 180 

ATAGTTCCTG TTGGATTTTC . AAAAGCAAAT TACGGTGTAA GTGCTGATAA AGGAGTTTTA 24 0 

CCATGGACAG AAGAAAAAAT CAAAGAACTA AATGGTAAAG CTAACCTATT TGACGATTTG 300 

GATGGACTTA ACTTTGAAGC AATATCAAAT TCTAAACCAG ATGTTATCTT AGCAGGTTAT 3 60 

TCTGGTATAA CTAAAGAAGA TTATGACACT CTATCA 3 96 

(2) INFORMATION FOR SEQ ID NO:194: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 
. (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Cys lie Val Phe Ser Ala Cys Ser Ser Asn Ser Xaa Lys Asn Glu Glu 
1 5 10 15 

Asn Thr Ser Lys Glu His Ala Pro Asp Lys lie Val Leu Asp* His Ala 
20 25 30 

- Phe Gly Gin Thr lie Leu Asp Lys Lys Pro Glu Arg Val Ala Thr. lie 
35 40 .45 

Ala Trp Gly Asn His~Asp Val Ala Leu Ala Leu Gly lie VaT Pro Val 
50 55 60 

Gly Phe Ser Lys Ala Asn Tyr Gly Val" "Ser Ala Asp Lys Gly Val Leu 
65 70 ~ 75 80 



Pro. Trp Thr Glu Glu Lys lie Lys Glu Leu. Asn. Gly Lys Ala Asn Leu 
85 - 90 95 
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Phe Asp Asp Leu Asp Gly Leu Asn Phe Glu Ala lie Ser Asn Ser Lys 
100 105 110 

_ Pro- Asp Val lie Leu Ala Gly Tyr Ser Gly lie Thr Lys Glu Asp Tyr 
115 120 ' 125 

Asp Thr Leu Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:- 195: — 

GTGTGTC GAG CATATTCTGA AGCAAACCTA TCAAAATATA GAAATTATTT TAGTTGATGA 60 

CGGTTCTACG GATAATTCTG GGGAAATTTG TGATGCTTTT ATGATGCAAG ATAATCGTGT 120 

GCGAGTATTG CATCAAGAAA ATAAGGGGGG GGCAGCACAA GCTAAAAATA TGGGGATTAG 180 

TGTAGCTAAG G G AG AGT AC A TCACGATTGT TGATTCAGAT GATATCGTAA AAGAAAATAT 240 

GATTGAAACT CTTTATCAGC AAGTC C AAGA AAAGGATGCA GATGTTGTTA TAGGGAATTA 3 00 

CTATAATTAT GACGAAAGTG AC GGGAATTT TTATTTTTAT GTAACAGGGC AAGATTTTTG 3 60 

CGTCGAAGAA TTAGCTATAC AAGAAATTAT GAACCGTCAA GCAGGAGATT GGAAATTCAA 42 0 

TAGCTCGGCC TTTATATTGC CGACATTTAA GTTGATTAAA AAAGAATTAT ' TCAATGAAGT 48 0 

TCACTTTTCA AATGGTCGCC GCTTTGATGA TGAAGCAACT ATGCATCGCT TTTATC TTTT 540 

AGC CTCTAAA ATCGTCTTTA TAAACGATAA TCTCTATCTG TATAGAAGAC GTTCAGGAAG 600. 

CATCATGAGA AC GGAATTTG ATCTTTCCTG GGCAAGAGAT ATTGTTGAAG TGTTTTCTAA 660 

GAAAATATCG GATTGTGTCT TGGCTGGTTT GGATGTCTCC GTTCTGCGTA TTCGATTTGT 720 

CAATCTTTTA AAAGATTATA AGCAAACTTT AGAATACCAT CAATTAACAG ATACTGAGGA .780 

ATATAAAGAT ATTTGTTTCA GATTAAAGTT GTTTTTTGAT GCAGAACAAA GAAATGGTAA 840 

AAGT 844 



(2) INFORMATION FOR SEQ ID NO: 196 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. tii) MOLECULE TYPE: protein 



298 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Cys Val Glu His lie Leu Lys Gin Thr Tyr Gin Asn lie Glu lie lie 
1 5 10 _ 15 

Leu Val Asp Asp Gly Ser Thr Asp Asn Ser Gly Glu lie Cys Asp Ala 
20 25 30 

Phe Met Met Gin Asp Asn Arg Val Arg Val Leu His Gin Glu Asn Lys 
35 40 45 

.Gly Gly Ala Ala Gin Ala Lys Asn Met Gly lie Ser Val Ala Lys Gly 
50 - 55 60 

Glu Tyr lie Thr lie Val Asp Ser Asp Asp lie Val Lys Glu Asn Met 
65 70 75 80 

lie Glu Thr Leu Tyr Gin Gin Val Gin Glu Lys Asp Ala Asp Val Val 
85 90 ' 95 

lie Gly Asn Tyr Tyr Asn Tyr Asp Glu Ser Asp Gly Asn Phe Tyr Phe- 
100 105 • 110 

Tyr Val Thr Gly Gin Asp Phe Cys Val Glu Glu Leu Ala lie Gin Glu 
115 120 125 



lie Met Asn Arg Gin Ala Gly Asp Trp Lys Phe Asn Ser Ser Ala Phe 
130 135 140 

lie Leu Pro Thr Phe Lys Leu lie Lys Lys Glu Leu Phe Asn Glu Val 
145 150 . - 155 . 160 

His Phe Ser Asn Gly Arg Arg Phe Asp Asp Glu Ala Thr Met His Arg' 
165 170 . 175 

Phe Tyr Leu Leu Ala Ser Lys lie Val Phe lie Asn Asp Asn Leu Tyr 
180 185 190 

Leu Tyr Arg Arg Arg Ser Gly Ser lie Met Arg Thr Glu Phe Asp Leu 
195 200 205 

Ser Trp Ala Arg Asp lie Val Glu Val Phe Ser Lys Lys lie Ser Asp 
210 '■ 215 220 

Cys Val Leu Ala Gly Leu Asp Val Ser Val Leu Arg lie Arg Phe Val 
225 230 235 240 

Asn Leu Leu Lys Asp Tyr Lys Gin Thr Leu .Glu Tyr His Gin Leu Thr 
245 250 255 

Asp Thr Glu Glu Tyr Lys Asp lie Cys Phe Arg Leu Lys Leu Phe Phe 
260 265 270 

Asp Ala Glu Gin Arg Asn Gly Lys Ser 
275 280 

INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double . _j : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: ■ 

GTGTTTGGAT AGCATTGAGA ATCAGACGTA TCAAAATTTT GAGTGTTTAT TAATCAATGA 60 

TGGCTCTCCA GATCATTCAT CCAAAATATG TGAAGAATTT GTAGAGAAAG ATTCTCGTTT 12 O 

CAAATATTTT GAGAAAGCAA ACGGCGGTCT TTCATCAGCT CGTAACCTAG GTATTGAATG 180 

TTC GGGGGGG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGGTTGGAAC ATGATGCTTT 240 

AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACGCAGAT ATTAGTATCG GGCGTTATAA 3 00 

TTCTTATGAT GAAACACGCT ATGTGTATAT GACTTATGTT ACGGATCCAG ATGATTCTCT 3 60 

AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGGT GTCGAAGAAG TCAGAAATGG 42 0 

GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGAGAGAG TTACTACAAG ATTTACCATT 48 0 

TCCTATAGGA AAAATTGCAG AGGATACTTA CTGGACATGG AAGGTACTTC TAAGAGC TTC 540 

GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT CTGATACTTT 6 00 

ATCGAATACA TGGAGTGAAA AGC GTATGTA TGATGAAATT GGGGCTAGGG AAGAAAAGAT 660 

AGCTATTTTA GCAAGTTCAG ACTATGACTT G AC C AATC AT ATTTTGATTT ATAAAAATAG 720 
ATTACAAAGA GTGATAGCAA AATTAGAAGA ACAAAATATG CAGTTCACAG AGATTTACAG . 7 80 

AAGAATGATG GAAAAATTGT CTTTACTTCC G 811 



(2). INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY :' linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:198: 

Cys Leu Asp Ser lie Gin Asn Gin Thr Tyr Gin, Asn Phe Glu Cys Leu 

1 5 10 15 

Leu. lie Asn Asp Gly Ser Pro Asp His Ser Ser Lys lie Cys Glu Glu 

20. 25 30 

Phe Val Glu Lys Asp Ser Arg Phe Lys Tyr Phe Glu Lys Ala Asn Gly 

35 40 ,45 

Gly Leu Ser Ser Ala Arg Asn Leu Gly lie Glu Cys Ser Gly Gly Ala 

50 55 60 

Tyr lie Thr Phe Val Asp Ser Asp Asp Trp Leu Glu His Asp Ala' Leu 

65 70 75 80 

Asp Arg Leu Tyr "Gly Ala Leu Lys Lys Glu Asn Ala Asp Ile^Ser lie 

85 ' 90 95 
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Glu Thr Arg Tyr Val Tyr Met Thr Tyr 
105 110 

Leu Glu Val lie Glu Gly Lys Ala lie 
120 125 

Glu Val Arg Asn Gly Asn Trp Thr Val 
140- 

Arg Glu Leu Leu Gin Asp Leu Pro Phe 
155 160 

Asp Thr Tyr Trp Thr Trp Lys Val Leu 
170 175 

Tyr Leu Asn Arg Cys Val Tyr Trp Tyr 
185 190 

Leu Ser Asn Thr Trp Ser Glu. Lys Arg 
200 205 

Arg Glu Glu Lys lie Ala lie Leu Ala 
220 

Asn His lie Leu lie Tyr Lys Asn Arg 

235 240 

Leu Glu Glu Gin Asn Met Gin Phe Thr 
250 .255 

Glu Lys Leu Ser Leu Leu Pro 
265 270 

Q (2) . INFORMATION FOR SEQ ID NO : 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2023 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 199: 
GTGC CTAGAT AGTATTATTA CTCAAACATA TAAAAATATT GAGATTGTTG TCGTTAATGA - "60 

TGGTTCTACG GATGCTTCAG GTGAAATTTG TAAAGAATTT TCAGAAATGG ATCACCGAAT 12 0 

TCTCTATATA GAACAAGAAA ATGCTGGTCT TTCTGCCGCA CGAAACACCG GTCTGAATAA 180 

TATGTCCGGA AATTATGTGA CCTTTGTGGA CTCGGATGAT TGGATTGAGC AAGATTATGT .240 

. AGAAACTCTA TATAAAAAAA TAGTAGAGTA TCAGGCTGAT ATTGCAGTTG GTAATTATTA 300 
TTCTTTCAAC GAAAGTGAAG GAATGTTCTA CTTTCATATA TTGGGAGACT CCTATTATGA . 360 

GAAAGTATAT GATAATGTTT CTATCTTTGA GAACTTGTAT GAAACTCAAG AAATGAAGAG 42 0 

~TTTTGCTTTG ATATCTGCTT GGGGTAAACT CTATAAGGCA agattgtttg agcagttgcg- 480 

CTTTGACATA GGTAAATTAG GAGAAGATGG TTACCTCAAT CAAAAGGTAT ATTTATTATC 540 



Gly Arg Tyr Asn Ser Tyr Asp 
100 

Val Thr Asp Pro Asp Asp Ser 
115 

Met Asp Arg Glu Gly Val Glu 
130 135 

Ala Val Leu Lys Leu Phe Lys 
145 150 

Pro lie Gly Lys lie Ala Glu 
165 

Leu Arg Ala Ser Arg lie Val 
180 

^ Arg Val Gly Leu Ser Asp Thr 

y Met Tyr Asp Glu lie Gly Ala 

& 210 215 

!1j Ser Ser Asp Tyr Asp Leu Thr 

"••3 225 230 

Leu Gin Arg Val lie Ala Lys 

b 245 

Glu lie Tyr Arg Arg Met Met 

iU 260 
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AGAAAAGGTA ATTTATTTAA ATAAAAGTCT TTATGCTTAT CGGATTAGAA AAGGTAGTTT 600 

ATCAAGAGTT TGGACAGAAA AGTGGATGCA CGCTTTAGTT GATGCTATGT CTGAACGTAT 660 

TACGCTACTA GCTAATATGG GTTATCCTCT AGAGAAACAC TTGGCAGTTT ATCGTCAGAT 720 

GTT GGAAGTC AGTGTC GCC A ACGGTCAAGC TAGTGGTTTA TCTGACACAG CAACGTATAA 780 

AGAGTTTGAA ATGAAACAAA GGCTTTTAAA TCAGCTATCG AGACAAGAGG AAAGTGAAAA 840 

GAAAGCCATT GTCCTCGCAG CAAACTATGG CTATGTAGAC CAAGTTTTAA CGACAATCAA 900 

GTCTATTTGT TATCATAATC GTTCGATTCG TTTTTATCTG ATTCATAGCG ATTTTC C AAA 960 

TGAATGGATT AAGCAATTAA ATAAGCGCTT AGAGAAGTTT GACTCAGAAA TTATTAATTG 1020 

TCGGGTAACT TCTGAGCAAA TTTCATGTTA TAAATC GGAT ATTAGTTACA CAGTCTTTTT 1080 

- ACGCTATTTC ATAGCTGATT TCGTGCAAGA AG AC AAGGC C CTCTACTTGG ACTGTGATCT 1140 

AGTTGTAACG AAAAATCTGG ATGACTTGTT TGCTACAGAC TTACAAGATT ATC CTTTGGC 1200 

TGC TGTTAGA GATTTTGGGG GCAGAGCTTA TTTTGGTCAA GAAATCTTTA ATGCCGGTGT 12 60 

TCTCTTGGTA AACAATGCTT TTTGGAAAAA AGAGAATATG ACCCAAAAAT TAATTGATGT 1320 

AACCAATGAA TGGCATGATA AGGTGGATCA GGCAGATCAG AGCATCTTGA ATATGCTTTT 13 80 

TGAACATAAA TGGTTGGAAT TGGACTTTGA TTATAATCAT ATTGTCATTC ATAAACAGTT 1440 

TGCTGATTAT CAATTGCCTG AGGGTCAGGA TTATCCTGCT ATTATTCACT ATCTTTCTCA 1500- 

TCGGAAACCG TGGAAAGATT TGGCGGCCCA AACCTATCGT GAAGTTTGGT GGTACTATCA 1560 

TGGGCTTGAA TGGACAGAAT TGGGACAAAA CCATCATTTA CATC C ATT AC AAAGATCTCA 1620 

CATCTATCCA ATAAAGGAAC CTTTCACTTG TCTAATCTAT ACTGCCTCAG ACCATATTGA 1680 

ACAAATTGAG ACATTGGTTC AATCCTTGCC TGATATTCAG TTTAAGATAG CAGCTAGAGT 1740 

AATAGTTAGT GATCGATTGG CTCAGATGAC AATTTATCCA AAC GTGACTA TATTTAACGG 1800 

AATTCACTAT TTGGTAGATG TCGATAATGA ATTGGTAGAA ACCAGTCAAG TACTTTTAGA 1860 

TATTAATCAT GGC GAAAAGA CAGAAGAAAT TCTCGATCAA TTTGCTAATC TTGGCAAGCC 1920. 

TATCTTATCC TTTGAAAATA CTAAAACCTA TGAAGTAGGT CAGGAGGCAT ATGCTGTTGA 1980 

CCAAGTTCAA GCAATGATTG AAAAATTGAG AGAAATAAGC AAA ■ .2023 



(2) INFORMATION FOR SEQ . ID NO:200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 amino acids 

(B) TYPE: amino acid 

( C ) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

i<ii)_MOLECULE jTYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 
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Cys Leu Asp Ser lie lie Thr Gin Thr Tyr Lys Asn lie Glu lie Val 
1 5 10 15 

Val Val Asn Asp Gly Ser Thr Asp Ala Ser Gly Glu lie Cys Lys Glu 
20 25 . 30 

Phe Ser Glu Met Asp His Arg lie Leu Tyr lie Glu Gin Glu Asn Ala 
35 40 45 

Gly Leu Ser Ala Ala Arg Asn Thr Gly Leu Asn Asn Met Ser Gly Asn 
50 .55 60 

Tyr Val Thr Phe Val Asp Ser Asp Asp Trp lie Glu Gin Asp Tyr Val 
65 70 75 80 

Glu Thr Leu Tyr Lys Lys lie Val Glu Tyr Gin Ala Asp lie Ala Val 
85 .90 95 

Gly Asn Tyr Tyr Ser Phe Asn Glu Ser Glu Gly Met Phe Tyr Phe His 
100 , 105 110 

lie Leu Gly Asp Ser Tyr Tyr Glu Lys Val Tyr Asp Asn Val Ser lie 
115 120 125 

Phe Glu Asn Leu Tyr Glu Thr Gin Glu Met Lys Ser Phe Ala Leu lie 
130 135 140 

Ser Ala Trp Gly Lys Leu Tyr Lys Ala Arg Leu Phe Glu Gin Leu Arg 
145 150 155 160 

Phe Asp lie Gly Lys Leu Gly Glu Asp Gly Tyr Leu Asn Gin Lys Val 
165 170 175 

Tyr Leu Leu Ser Glu Lys Val lie Tyr Leu Asn Lys Ser Leu Tyr Ala 
180 185 190 

Tyr Arg lie Arg Lys Gly Ser Leu Ser Arg Val Trp Thr Glu Lys Trp 
195 200 205 

Met His Ala Leu Val Asp Ala Met Ser Glu Arg lie Thr Leu Leu Ala 
210 215 220 

Asn Met Gly Tyr Pro Leu Glu Lys His Leu Ala Val Tyr Arg Gin Met 
225 230 235 240 

Leu Glu Val Ser Leu Ala Asn Gly Gin Ala Ser Gly Leu Ser Asp Thr 
245 250 255 

Ala Thr Tyr Lys Glu Phe Glu Met Lys Gin Arg Leu Leu Asn Gin Leu 
260 . 265 270 

Ser Arg Gin Glu' Glu Ser Glu Lys Lys Ala lie Val Leu Ala Ala Asn 
275 280 285 

Tyr Gly Tyr Val Asp Gin- Val Leu Thr Thr lie Lys Ser lie Cys Tyr 
290 295 .300 

His Asn Arg Ser lie Arg Phe Tyr Leu lie His . Ser Asp Phe Pro Asn 
305 1 310 315 . 320 



Glu Trp lie Lys Gin Leu Asn Lys Arg Leu Glu Lys Phe Asp Ser Glu 
325 330 335 
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lie Ile„Asn Cys Arg Val Thr Ser Glu Gin lie Ser Cys Tyr Lys Ser 
340 . 345 350 

Asp lie Ser Tyr Thr Val Phe Leu Arg Tyr Phe lie Ala Asp Phe Val 
355 360 365 

Gin Glu Asp Lys Ala Leu Tyr Leu Asp Cys Asp Leu Val Val Thr Lys 
370 375 380 

Asn Leu Asp Asp Leu Phe Ala Thr Asp Leu Gin Asp Tyr Pro Leu Ala 
385 390 ' 395 400 

Ala Val Arg Asp Phe Gly Gly Arg Ala Tyr Phe Gly Gin Glu lie Phe 
405 410 415 

Asn Ala Gly Val Leu Leu Val Asn Asn Ala Phe Trp Lys Lys Glu Asn 
420 425 430 

Met Thr Gin Lys Leu lie Asp Val Thr Asn Glu Trp His Asp Lys Val 
435 440 445 

Asp Gin Ala Asp Gin Ser lie Leu Asn Met Leu Phe Glu His Lys Trp 
450 455 460 

Leu Glu Leu Asp Phe Asp Tyr Asn His lie Val lie His Lys Gin Phe 
465 470 475 ' 480 

Ala Asp Tyr Gin Leu Pro Glu Gly Gin Asp Tyr Pro Ala lie lie His 
485 490 495 

Tyr Leu Ser His Arg Lys Pro Trp Lys Asp Leu Ala Ala Gin Thr Tyr 
500 505 • \ 510 

Arg Glu Val Trp Trp Tyr Tyr His Gly Leu Glu Trp Thr Glu Leu Gly 
515 520 525 

Gin Asn His His Leu .His Pro Leu Gin Arg Ser His lie Tyr Pro lie 
530 ... 535 540 

' Lys Glu Pro Phe Thr Cys Leu lie Tyr Thr Ala Ser Asp His lie Glu 
545 - , 550 555 560 

Gin lie Glu Thr Leu Val Gin Ser Leu Pro Asp lie Gin Phe Lys lie 
565 570 575 

■Ala Ala Arg Val lie Val Ser Asp Arg Leu Ala Gin Met Thr lie Tyr 
580 . 585 590 

Pro Asn Val Thr lie Phe Asn Gly lie His Tyr Leu Val Asp Val Asp 
595 600 605 

Asn Glu Leu Val Glu Thr Ser Gin Val Leu Leu Asp lie Asn His Gly 
610 * 615 620 

Glu Lys Thr Glu Glu lie Leu Asp Gin Phe Ala Asn Leu Gly Lys Pro 
625 630 635 640 

lie Leu" Ser Phe Glu Asn Thr Lys Thr Tyr Glu Val Gly Gin Glu Ala 
645 _650 , 655 

Tyr Ala Val Asp Gin Val Gin Ala Met lie Glu Lys Leu Arg Glu lie 
660 665 670 
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Ser Lys 

(2) INFORMATION FOR.JSEQ ID NO : 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 201: 

CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT GTTGATGATG GTGCAACAGA 60 

TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT GACAGGGTGT CAGTGCTTCA 120 

TAAAAAGAAC GAAGGATTGT - CGCAAGCACG - AAATGATGGG ATGAAGCAGG CTCACGGGGA 180 

TTATCTGATT TTTATTGACT CAGATGATTA TATCCATCCA GAAATGATTC AGAGCTTATA 240 

TGAGCAATTA GTTCAAGAAG ATGCGGATGT TTCGAGCTGT GGTGTCATGA ATGTCTATGC 3 00 

TAATGATGAA AGCCCACAGT CAGCCAATCA GGATGAC TAT TTTGTCTGTG ATTCTCAAAC 3 60 

ATTTCTAAAG GAATACCTCA TAGGTGAAAA AATACCTGGG ACGATTTGCA ATAAGC TAAT 420 

CAAGAGACAG ATTGCAACTG CCCTATCCTT TCCTAAGGGG TTGATTTACG AAGATGCCTA 480 

TTACCATTTT GATTTAATCA AGTTGGCCAA GAAGTATGTG GTTAATACTA AACCCTATTA 540 

TTACTATTTC CATAGAGGGG , ATAGTATTAC GACCAAACCC TATGCAGAGA AGGATTTAGC 600 

CTATATTGAT ATCTACCAAA AGTTTTATAA TGAAGTTGTG AAAAACTATC CTGACTTGAA 660 

AGAGGTCGCT TTTTTCAGAT TGGCCTATGC CCACTTCTTT ATTCTGGATA AGATGTTGCT -'- 7 720 

AGATGATCAG TATAAACAGT TTGAAGCCTA TTCTCAGATT CATC GTTTTT TAAAAGGCCA 780 

TGCCTTTGCT ATTTCTAGGA ATCCAATTTT CCGTAAGGGG AGAAGAATTA GTGCTTTGGC 840 

CCTATTCATA AATATTTCCT TATATCGATT . CTTATTACTG AAAAATATTG AAAAATCTAA 900 
AAAATTACAT : . 910 
(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 03 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS:. .single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: protein 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



lie Gin Lys Gin Thr Tyr Gin Asn Leu Glu lie lie Leu Val Asp Asp 
1 ' 5 10 15 
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Gly Ala Thr Asp Glu Ser Gly Arg Leu Cys Asp Ser lie Ala Glu Gin 
20 .25 30 



Asp Asp Arg Val Ser Val Leu His_Lys Lys Asn Glu Gly Leu Ser Gin 
35 '40 45 

Ala Arg Asn Asp Gly^Met Lys Gin Ala His Gly Asp Tyr Leu lie Phe 
50 55 60 

lie Asp Ser Asp Asp Tyr lie' His Pro Glu Met lie Gin Ser Leu Tyr 
65 .70 75 80 

Glu Gin Leu Val Gin Glu Asp Ala Asp Val Ser Ser Cys Gly Val Met 

"85 90 ' . 95 

Asn Val Tyr Ala Asn Asp Glu Ser Pro Gin Ser Ala Asn Gin Asp Asp 
100 105- 110 

Tyr Phe Val Cys Asp Ser Gin Thr Phe Leu Lys Glu Tyr Leu lie Gly 

115 - - - 120 - : ; -. 125 

Glu Lys lie Pro Gly Thr lie Cys Asn Lys Leu lie Lys Arg Gin lie 
130 135 140 

Ala Thr Ala Leu Ser Phe Pro Lys Gly Leu lie Tyr Glu Asp Ala Tyr 
145 . 150 155 160 

Tyr His Phe Asp Leu lie Lys Leu Ala Lys Lys Tyr Val Val Asn Thr 
165 170 175 

Lys Pro Tyr Tyr Tyr Tyr Phe His Arg Gly Asp Ser lie Thr Thr Lys 
180 185 190 

Pro Tyr Ala Glu Lys Asp Leu Ala Tyr lie Asp lie Tyr Gin Lys Phe 
195 200 205 

Tyr Asn Glu Val Val Lys Asn Tyr Pro Asp Leu Lys 'Glu Val Ala Phe 
210 215 - 220 

Phe Arg Leu Ala Tyr Ala His Phe Phe . lie Leu Asp Lys Met Leu Leu 
225 230 235 240 

Asp Asp Gin Tyr Lys Gin Phe Glu Ala Tyr Ser Gin lie His Arg Phe 
. 245 250 255 

Leu Lys Gly His Ala Phe Ala lie Ser Arg Asn Pro lie Phe Arg Lys 
260 265 270 

Gly Arg Arg lie Ser Ala Leu Ala Leu Phe lie Asn lie Ser Leu Tyr 
275 280 285 

Arg Phe Leu Leu Leu Lys Asn lie Glu Lys Ser Lys Lys Leu His 
290 295 300 . 

(2) INFORMATION FOR SEQ ID NO f 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1972 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ~ 

.-<D)_TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

TAAGGCTGAT AATCGTGTTC AAATGAGAAC GACGATTAAT AATGAATCGC. CATTGTTGCT 60 

TTCTCCGTTG TATGGCAATG ATAATGGTAA C GG ATTATGG TGGGGGAACA CATTGAAGGG 12 0 

AGCATGGGAA GCTATTCCTG AAGATGTAAA GCCATATGCA GCGATTGAAC TTCATCCTGC 180 

AAAAGTCTGT AAACCAACAA GTTGTATTCC ACGAGATACG AAAGAATTGA GAGAATGGTA 240 

TGTCAAGATG TTGGAGGAAG CTCAAAGTCT AAACATTCCA GTTTTCTTGG TTATTATGTC . 3 00 

GGCTGGAGAG CGTAATACAG TTCCTCCAGA GTGGTTAGAT GAACAATTCC AAAAGTATAG 3 60 

TGTGTTAAAA GGTGTTTTAA ATATTGAGAA TTATTGGATT TACAATAACC AGTTAGCTCC 420 

GCATAGTGCT AAATATTTGG AAGTTTGTGC CAAATATGGA GCGCATTTTA TCTGGCATGA 48 0 

TCATGAAAAA TGGTTCTGGG AAACTATTAT GAATGATCCG" ACATTCTTTG AAGCGAGTCA 54 0 

AAAATATCAT AAAAATTTGG TGTTGGCAAC TAAAAATACG CCAATAAGAG ATGATGCGGG 600 

TACAGATTCT ATCGTTAGTG GATTTTGGTT GAGTGGCTTA TGTGATAACT GGGGCTCATC 660 

AACAGATACA TGGAAATGGT GGGAAAAACA TTATACAAAC ACATTTGAAA CTGGAAGAGC 720 

TAGGGATATG AGATC CTATG CATCGGAACC AGAATCAATG ATTGCTATGG AAATGATGAA - 780 

TGTATATACT GGGGGAGGCA CAGTTTATAA TTTCGAATGT GCCGCGTATA CATTTATGAC 840 

AAATGATGTA CCAACTCCAG CATTTACTAA AGGTATTATT CCTTTCTTTA GACATGCTAT 900 

Q ACAAAATCCA GCTCCAAGTA AGGAAGAAGT TGTAAATAGA ACAAAAGCTG TATTTTGGAA 960 

TGGAGAAGGT AGGATTAGTT CATTAAACGG ATTTTATCAA GGACTTTATT C GAATG ATG A 1020 

_ AACAATGCCT TTATATAATA ATGGGAGATA TCATATTCTT CCTGTAATAC ATGAGAAAAT . 1080 

TGATAAGGAA AAGATTTCAT CTATATTCCC TAATGCAAAA ATTTTGACTA AAAATAGTGA 1140 

GGAATTGTCT AGTAAAGTCA ACTATTTAAA CTCGCTTTAT CCAAAACTTT ATGAAGGAGA ' 12 00 

TGGGTATGCT CAGCGTGTAG GTAATTCCTG GTATATTTAT AATAGTAATG CTAATATCAA 1260 

TAAAAATCAG CAAGTAATGT TGC CTATGTA TACTAATAAT AC AAAGTC GT TATCGTTAGA . 1320 

TTTGACGCCA CATAC TTACG CTGTTGTTAA AGAAAATCCA AATAATTTAC ATATTTTATT 1380 
GAATAATTAC AGGACAGATA AGACAGCTAT GTGGGCATTA TCAGGAAATT . TTGATGCATC ' 1440 

r AAAAAGTTGG AAGAAAGAAG AATTAGAGTT AGCGAACTGG ATAAGCAAAA ATTATTCCAT 1500 

■. CAATCCTGTA GATAATGACT TTAGGACAAC AACACTTACA TTAAAAGGGC ATACTGGTCA 1560 

. TAAACCTCAG ATAAATATAA GTGGCGATAA AAATCATTAT ACTTATACAG AAAATTGGGA 162 0 

TGAGAAT AC C CATGTTTATA CCATTACGGT TAATCATAAT GGAATGGTAG AGATGTCTAT 168 0 

AAATACTGAG GGGACAGGTC CAGTCTCTTT CCCAACACCA jGATAAATTTA ATGATGGTAA 1740 

TTTGAATATA GCATATGCAA AACCAACAAC ACAAAGTTCT GTAGATTACA ATGG AG AC C C 18 00 



'I 

2=1 3 



# • 
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TAATAGAGCT GTGGATGGTA ACAGAAATGG TAATTTTAAC TCTGGTTCGG TAACACACAC I860 

TAGGGCAGAT AATCCCTCTT GGTGGGAAGT CGATTTGAAA AAAATGGATA AAGTTGGGCT 192 0 

TGTTAAAATT TATAATCGCA CAGATGCTGA GACTCAACGT CTATCTAATT TT 1972 
(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04: 

Lys Ala Asp Asn Arg Val Gin Met Arg Thr Thr lie Asn Asn Glu Ser 
15 10 15 

Pro Leu Leu Leu Ser Pro Leu Tyr Gly Asn Asp Asn Gly Asn Gly Leu 
20 25 30 

Trp Trp Gly Asn Thr Leu Lys Gly Ala Trp Glu Ala lie Pro Glu Asp 
35 40 45 

Val Lys Pro Tyr Ala Ala lie Glu Leu His Pro Ala Lys Val Cys Lys 
50 55 60 

. Pro Thr Ser Cys lie Pro Arg Asp Thr Lys Glu Leu Arg Glu Trp Tyr 
65 70 " - 75 80 

Val Lys Met Leu Glu Glu Ala Gin Ser Leu Asn lie Pro Val .Phe Leu 
85 90 . 95 

Val lie Met Ser Ala Gly Glu Arg Asn Thr Val' Pro Pro Glu Trp Leu 
100 105 110 

Asp Glu Gin Phe Gin Lys Tyr Ser Val Leu Lys Gly Val Leu Asn lie 
115 120 125 

Glu Asn Tyr Trp lie Tyr Asn Asn Gin Leu Ala Pro His Ser Ala Lys 
130 135 140 

Tyr Leu Glu Val Cys Ala Lys Tyr Gly Ala His Phe lie Trp His Asp 
145 150 155 160 

His. Glu Lys Trp Phe Trp Glu Thr lie Met Asn Asp Pro Thr Phe Phe. 

165 170 175 

Glu Ala Ser Gin Lys Tyr His Lys Asn Leu Val Leu Ala Thr Lys Asn 
180 185 - 190 

Thr Pro lie Arg Asp Asp Ala Gly Thr Asp Ser lie Val Ser Gly Phe 
195 200 205 

Trp Leu Ser Gly Leu Cys Asp Asn Trp Gly Ser Ser Thr Asp Thr_ Trp 
210 215 220 



Lys Trp Trp Glu Lys His Tyr Thr Asn Thr Phe Glu Thr Gly Arg Ala 



# 
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225 230 235 240 

Arg Asp Met Arg Ser Tyr Ala Ser Glu Pro Glu Ser Met lie Ala Met 
245 250 . 255 

Glu Met Met Asn Val Tyr Thr* Gly Gly Gly Thr Val Tyr ' Asn Phe Glu 
260 265 270 

Cys Ala Ala Tyr Thr Phe Met Thr Asn Asp Val Pro Thr Pro Ala Phe 
275 ' 280 285 

Thr Lys Gly lie lie Pro Phe Phe Arg His Ala' lie Gin Asn Pro Ala 
290 295 300 

Pro Ser' Lys Glu Glu Val Val Asn Arg Thr Lys Ala Val Phe Trp Asn' 
305 310 315 . . 320 

Gly Glu Gly Arg lie Ser Ser Leu Asn Gly Phe Tyr Gin Gly Leu Tyr 
325 ' 330 335 

Ser Asn Asp Glu Thr Met Pro Leu Tyr Asn Asn Gly Arg Tyr His lie 
340 345 350 

Leu Pro Val lie His Glu Lys lie Asp Lys Glu Lys lie Ser Ser lie 
. 355 360 365 

Phe Pro Asn Ala Lys lie Leu Thr Lys Asn Ser Glu Glu Leu Ser Ser 
370 375 380 

Lys Val Asn Tyr Leu Asn Ser Leu Tyr Pro Lys Leu Tyr Glu Gly Asp 
385 390 395 400 

Gly Tyr Ala Gin Arg Val Gly Asn Ser Trp Tyr lie Tyr Asn Ser Asn 
* 405 . 410 415 

Ala Asn lie Asn Lys" Asn Gin Gin Val Met Leu Pro Met Tyr Thr Asn 
420 425 430 

Asn Thr Lys Ser Leu Ser Leu Asp Leu Thr Pro His Thr Tyr Ala Val 
435 440 445 

Val Lys Glu Asn Pro Asn Asn Leu His lie Leu Leu Asn Asn Tyr Arg 
450 455 . 460 

Thr Asp Lys Thr Ala Met Trp Ala Leu Ser Gly Asn Phe Asp Ala Ser 
465 470 475 480 

Lys Ser Trp Lys Lys Glu Glu Leu Glu Leu Ala Asn Trp lie Ser Lys 
485 490 495 

Asn Tyr Ser lie Asn Pro Val Asp Asn Asp Phe Arg Thr Thr Thr Leu 
500 • 505 510 

Thr Leu Lys Gly His Thr Gly His Lys Pro Gin lie Asn lie Ser Gly 
515 . 520 . N 525 

*•* 

Asp Lys Asn His Tyr Thr Tyr Thr Glu Asn Trp Asp Glu Asn Thr His 
530 535 540 

Val Tyr Thr lie Thr Val Asn His Asn Gly Met Val Glu Met Ser lie 
545 ■ 550 555 ... 560 



Asn Thr Glu Gly Thr Gly Pro Val Ser Phe Pro Thr Pro Asp Lys Phe 
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565 570 575 

Asn Asp Gly Asn Leu Asn lie Ala Tyr Ala Lys Pro Thr Thr Gin Ser 
580 585 590 

Ser Val Asp Tyr Asn Gly Asp Pro Asn Arg Ala Val Asp Gly Asn Arg 
595 600 605 

Asn Gly Asn Phe Asn Ser Gly Ser Val Thr His Thr Arg Ala Asp Asn 
610 615 620 

q ^ 

Pro Ser Trp Trp Glu Val Asp Leu Lys Lys Met Asp Lys Val Gly Leu 
625 630 635 640 

Val Lys lie Tyr Asn Arg Thr Asp Ala Glu Thr Gin Arg Leu Ser Asn 
645 650 655 

Phe 



(2) INFORMATION FOR SEQ ID NO : 205: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

CTGTGGCAAT CAGTCAGCTG CTTCCAAACA GTCAGCTTCA GGAACGATTG AGGTGATTTC 60 

ACGAGAAAAT GGCTCTGGGA C AC GGGGTGC CTTCACAGAA ATCACAGGGA TTCTCAAAAA 120 
AGACGGTGAT AAAAAAATTG ACAACACTGC CAAAACAGCT GTGATTCAAA ATAGTACAGA - 18 0 

AGGTGTTCTC TCAGCAGTTC AAGGGAATGC TAATGCTATC GGCTACATCT C CTTGGG ATC 240 

TTTAACGAAA TCTGTCAAGG CTTTAGAGAT TGATGGTGTC AAGGCTAGTC GAGACACAGT 3 00 

TTTAGATGGT GAATACCCTC TTCAACGTCC CTTCAACATT GTTTGGTCTT CTAATCTTTC 3 60 

CAAGCTAGGT CAAGATTTTA TCAGCTTTAT CCACTCCAAA CAAGGTCAAC AAGTGGTCAC 420 

AGATAATAAA TTTATTGAAG CTAAAACCGA AACCAC GGAA TATACAAGCC AACACTTATC 480 

AGGCAAGTTG TCTGTTGTAG GTTCCACTTC AGTATCTTCT TTAATGGAAA AATTAGCAGA 540 

AGCTTATAAA AAAGAAAATC CAGAAGTTAC GATTGATATT ACCTCTAATG GGTCTTCAGC 600 

AGGTATTACC GCTGTTAAGG AGAAAACCGC TGATATTGGT ATGGTTTCTA GGGAATTAAC 660 

TCCTGAAGAA GGTAAGAGTC TCACCCATGA TGCTATTGCT TTAGACGGTA TTGCTGTTGT 720 

GGTCAATAAT GACAATAAGG CAAGCCAAGT CAGTATGGCT GAACTTGCAG ACGTTTTTAG 780 

TGGCAAATTA ACCACCTGGG ACAAGATTAA A 811 

(2) INFORMATION FOR SEQ . ID NO: 2 06: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 

• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

Cys Gly Asn Gin Ser Ala Ala Ser Lys Gin Ser Ala Ser Gly Thr lie 
1 5 15 

Glu Val lie Ser Arg Glu Asn Gly Ser Gly Thr Arg Gly Ala Phe Thr 
20 25 30 

Glu lie Thr Gly lie Leu Lys Lys Asp Gly Asp Lys Lys lie Asp Asn 
35 40 45 

Thr Ala Lys Thr Ala Val lie Gin Asn Ser Thr Glu Gly Val Leu Ser 

50 -55 60 

Ala Val Gin Gly Asn Ala Asn Ala lie Gly Tyr lie Ser Leu Gly Ser 
65 70 75 80 

Leu Thr Lys Ser Val Lys Ala Leu Glu lie Asp Gly Val Lys Ala Ser 
85 90 95 

Arg Asp Thr Val Leu Asp Gly Glu Tyr Pro Leu Gin Arg Pro Phe Asn 
100 105 110 

lie Val Trp Ser Ser Asn Leu Ser Lys Leu Gly Gin Asp Phe lie Ser 
115 120 125 

Phe lie His Ser Lys Gin Gly Gin Gin Val Val Thr Asp Asn Lys Phe 
130 135 140 

. lie Glu Ala Lys Thr Glu Thr Thr Glu Tyr Thr Ser Gin His Leu Ser 
145 150 , 155 160 

Gly . Lys Leu Ser Val Val Gly Ser Thr Ser Val Ser Ser Leu Met Glu 
165 170 ■ 175 

Lys Leu Ala Glu Ala Tyr Lys Lys Glu Asn Pro Glu Val Thr lie Asp 
180 185 190 

lie Thr Ser Asn Gly Ser Ser Ala Gly lie Thr Ala Val Lys Glu Lys 
195 200 205 

Thr Ala Asp lie Gly Met Val Ser Arg Glu Leu Thr Pro Glu Glu Gly 
210 215 220 

Lys Ser Leu Thr His Asp Ala lie Ala Leu Asp Gly lie Ala Val Val 
225 230 235 240 

Val Asn Asn Asp Asn Lys Ala Ser Gin Val Ser Met Ala Glu Leu Ala 
245 250 255 

Asp Val Phe Ser Gly Lys Leu Thr Thr Trp Asp Lys lie Lys 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 207: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

TTGTCAACAA CAACATGCTA CTTCTGAGGG GACGAATCAA AGGCAAAGCA GTTCAGCGAA '60 

AGTTCCATGG AAAGCTTCAT ACACCAACCT AAACAACCAG GTAAGTACAG AAGAGGTCAA 12 0 

ATCTCTCTTA TCAGCTCACT TGGATCCAAA TAGTGTTGAT GCATTTTTTA ATCTCGTTAA 18 0 

TGACTATAAT ACCATTGTCG GCTCAACTGG CTTATCAGGA GATTTC AC TT CCTTTACTCA 240 

C AC CGAATAC GATGTTGAGA AAATCAGTCA TCTCTGGAAT CAAAAGAAGG GCGATTTTGT 3 00 

"TGGGACCAAC TGCCGTATCA ATAGTTATTG' TCTTTTGAAA AATTCAGTCA CCATTCCAAA 3 60 

GCTTGAAAAG AATGACCAGT TGCTTTTCCT AGATAATGAT GCGATTGATA AAGGAAAGGT 420 

CTTTGATTCA CAAGATAAGG AAGAGTTTGA TATTCTATTT TCGAGAGTTC CAACTGAGTC 480 

AACTACAGAT GTCAAGGTTC ACGCTGAAAA GATGGAAGCA TTCTTCTCAC AATTTCAATT 540 

CAATGAAAAA GCTCGAATGC TGTCTGTAGT CTTGCACGAC AATTTGGATG GC GAGTATCT 600 

GTTTGTAGGC CACGTTGGGG TCTTAGTACC TGCTGATGAC GGTTTCTTAT TTGTAGAGAA 660 

ATTGACTTTC GAAGAGCCCT ACCAAGCGAT TAAATTTGCT AGTAAGGAAG ATTGCTACAA 720 

GTATTTGGGC ACCAAGTATG CGGATTATAC AGGCGAGGGA CTGGCTAAGC C TTTTATC AT 78 0 

GGATAATGAT AAGTGGGTTA AACTT 805 
(2) INFORMATION FOR SEQ ID NO: 2 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

Cys Gin Gin Gin His Ala Thr Ser Glu Gly Thr Asn Gin Arg Gin Ser 
1 5 .10 15 

Ser Ser Ala Lys Val Pro Trp Lys ^Ala Ser Tyr Thr Asn Leu Asn Asn 
20 25 30 

Gin Val Ser Thr Glu Glu Val Lys Ser Leu Leu Ser Ala His Leu Asp 

35 40 .45 



e . Pro Asn Ser Val Asp Ala Phe Phe Asn Leu Val Asn Asp Tyr Asn Thr 
50 55 60 
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lie Val Gly Ser Thr Gly Leu Ser Gly Asp Phe Thr Ser Phe Thr His 

65 . 70 75 80 

Thr. Glu Tyr Asp Val Glu Lys lie Ser His Leu Trp Asn Gin Lys Lys 
85 90 . 95 

Gly Asp Phe Val Gly Thr Asn Cys Arg lie Asn Ser Tyr Cys Leu Leu 

100 " 105 110 

Lys Asn Ser Val Thr lie Pro Lys Leu Glu Lys Asn Asp Gin Leu Leu 

' 115 120 125 

Phe Leu Asp Asn Asp "Ala lie "Asp Lys Gly Lys Val Phe Asp Ser Gin 

130 135 ..140 

Asp Lys Glu Glu Phe Asp lie Leu Phe Ser Arg Val Pro Thr Glu Ser 

145 150 155 160 

Thr Thr Asp Val Lys Val His Ala Glu Lys Met Glu Ala Phe Phe Ser 

165 ■ - 170 175 



Gin Phe" Gin Phe Asn Glu Lys Ala Arg Met Leu Ser Val Val Leu His 
' 180 185 190 

Asp Asn Leu Asp Gly Glu Tyr Leu Phe Val Gly His Val Gly Val Leu 
195 200 205 

Val Pro Ala Asp Asp Gly Phe Leu Phe Val Glu Lys Leu Thr Phe Glu 
210 215 220 

Glu Pro Tyr Gin Ala lie Lys Phe Ala Ser Lys Glu Asp Cys Tyr Lys 
225 230 235 , 240 

Tyr Leu Gly Thr Lys Tyr Ala Asp Tyr Thr Gly Glu Gly Leu Ala Lys 
245 250 .255 

Pro Phe lie Met Asp Asn Asp Lys Trp Val Lys Leu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
TTGTTCAGGC AAGTCCGTGA CTAGTGAACA C C AAAC G AAA GATGAAATGA AGACGGAGCA 
GACAGCTAGT AAAACAAGCG CAGCTAAAGG GAAAGAGGTG GCTGATTTTG AATTGATGGG 
AGTAGATGGC AAGAC CTACC GTTTATCTGA TTACAAGGGC AAGAAAGTCT ATCTCAAATT 
CTGGGCTTCT TGGTGTTCCA TCTGTCTGGC TAGTCTTCCA GATACGGATG AGATTGCTAA 
AGAAGCTGGT GATGACTATG TGGTCTTGAC AGTAGTGTCA CCAGGACATA AGGGAGAGCA 



ATCTGAAGCG GACTTTAAGA ATTGGTATAA GGGATTGGAT TATAAAAATC TCCCAGTCCT 
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AGTTGACCCA TCAGGCAAAC TTTTGGAAAC TTATGGTGTC CGTTCTTACC CAACCCAAGC 
CTTTATAGAC AAAGAAGGCA AGCTGGTCAA AACACATCCA GGATTCATGG AAAAAGATGC 
AATTTTGCAA ACTTTGAAGG AATTAGCC 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: . , 

Cys Ser Gly Lys Ser Val Thr Ser Glu His Gin Thr~ Lys Asp Glu Met 

1 -- ---- 5. - . 10 15 

Lys Thr Glu Gin Thr Ala Ser Lys Thr Ser Ala Ala Lys Gly Lys Glu 
20 25 30 

Val Ala Asp Phe Glu Leu Met Gly Val Asp Gly Lys Thr Tyr Arg Leu 

35 40 45 



Ser Asp Tyr Lys Gly Lys Lys Val Tyr Leu Lys Phe Trp Ala Ser Trp 
50 55 60 

Cys Ser lie Cys Leu Ala Ser Leu Pro Asp Thr Asp Glu lie Ala Lys 
65 70 75 80 

Glu Ala Gly Asp Asp Tyr Val Val Leu Thr Val Val Ser. Pro Gly His 
85 90 95 

Lys Gly Glu Gin Ser Glu Ala Asp Phe Lys Asn' Trp Tyr Lys Gly Leu 
100 105 110 

Asp Tyr Lys Asn Leu Pro Val Leu Val Asp Pro Ser Gly Lys Leu Leu 
115 120 125 

Glu Thr Tyr Gly Val Arg Ser Tyr Pro Thr Gin Ala Phe lie Asp Lys 
130 135 140 

Glu. Gly Lys Leu Val Lys Thr His Pro Gly Phe Met Glu Lys Asp Ala 
145 ■ - 150 . 155 160 

lie Leu Gin Thr Leu Lys Glu Leu Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 
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CTCGCAAATT GAAAAGGCGG C AGTTAGC C A AGGAGGAAAA GCAGTGAAAA AAACAGAAAT 60 

TAGTAAAGAC GCAGACTTGC ACGAAATTTA TCTAGCTGGA GGTTGTTTCT GGGGAGTGGA 12 0 

GGAATATTTC TCACGTGTTC" CCGGGGTGAC- GGATGCCGTT TCAGGCTATG CAAATGGTAG . 18 0 

AGGAGAAACA ACCAAGTACG AATTGATTAA CCAAACAGGT CATGCAGAAA CCGTCCATGT 240 

CACCTATGAT GCCAAGCAAA TTTCTCTCAA GGAAATCCTG CTTCACTATT TCCGCATTAT 3 00 
CAATCCAACC AGCAAAAATA AACAAGGAAA TGATGTGGGG ACCCAGTACC GTACTGGTGT . 3 60 

TTATTACACA GATGACAAGG ATTTGGAAGT GATTAACCAA GTCTTTGATG AGGTGGCTAA 420 

GAAATACGAT CAACCTCTAG CAGTTGAAAA GGAAAACTTG AAGAATTTTG TGGTGGCTGA 480 

GGATTACCAT CAAGACTATC TCAAGAAAAA TCCAAATGGC TACTGCCATA TCAATGTTAA 540 

TCAGGCGGCG TATCCTGTCA TTGATGCCAG CAAATATCCA AAACCAAGTG ATGAGGAATT 600 

GAAAAAGACC CTGTCACCTG AGGAGTATGC AGTTACCCAG GAAAATCAAA CAGAACGAGC 660 

TTTCTCAAAC CGTTACTGGG ATAAATTTGA ATCCGGTATC TATGTGGATA TAGCAACTGG 7 20 

GGAACCTCTC TTTTCATCAA AAGACAAATT TGAGTCTGGT TGTGGCTGGC C T AGTTTT AC 780 

CCAACCCATC AGTCCAGATG TTGTCACCTA CAAGGAAGAT AAGTCCTACA ATATGACGCG 840 

TATGGAAGTG CGGAGCCGAG TAGGAGATTC TCACCTTGGG CATGTCTTTA CGGATGGTCC 900 

ACAGGACAAG GGCGGCTTAC GTTACTGTAT CAATAGCCTC TCTATCCGCT TTATTC CCAA 960 

AGAC C AAATG GAAGAAAAAG GTACGCTTAT TTAC .994 



(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

' (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

Ser Gin lie Glu Lys Ala Ala Val Ser Gin Gly Gly Lys Ala Val Lys 
1 5 10 15 

Lys Thr Glu lie Ser Lys Asp Ala Asp Leu His Glu lie Tyr Leu Ala - 
20 2 5 3 0 

Gly Gly Cys Phe Trp Gly Val Glu Glu Tyr Phe Ser Arg Val Pro Gly 
35 40"~ 45 

Val Thr Asp Ala Val Ser Gly Tyr Ala Asn Gly Arg Gly Glu Thr. Thr 
50 55 60 ~ 

Lys Tyr Glu Leu lie Asn Gin Thr Gly. His Ala Glu Thr Val His Val 
65 70 75 80_ 



Thr Tyr Asp Ala Lys Gin lie Ser Leu Lys Glu lie Leu Leu His Tyr 
85 90 - 95 



Phe Arg lie lie Asn Pro Thr Ser Lys Asn Lys Gin Gly Asn Asp Val_ 
100 105 11.0__- 

Gly Thr Gin Tyr Arg Thr Gly Val Tyr Tyr Thr Asp Asp Lys Asp Leu 
115 120 125 

Glu Val- lie Asn Gin Val Phe Asp Glu Val Ala Lys Lys Tyr Asp Gin 

130 135 140 

Pro Leu Ala Val Glu Lys Glu Asn Leu Lys Asn Phe Val Val Ala Glu 

145 150 155' 160 

Asp Tyr His Gin Asp Tyr Leu Lys, Lys Asn Pro Asn Gly Tyr Cys His 

165 170 175 v 

lie Asn Val Asn Gin Ala Ala Tyr Pro Val lie Asp Ala Ser Lys 'Tyr 
180 185 190 



Pro Lys Pro Ser Asp Glu Glu Leu Lys Lys Thr Leu Ser Pro .Glu Glu 
195 200 205 

Tyr Ala Val Thr Gin Glu Asn Gin Thr Glu Arg Ala Phe Ser Asn Arg 
210 215 220 



Tyr Trp Asp Lys Phe Glu 
225 230 

Glu Pro Leu Phe Ser Ser 
245 

Pro Ser Phe Thr Gin Pro 
260 

Asp Lys- Ser Tyr Asn Met 
275 

Asp Ser His Leu Gly His 
290 

Gly Leu Arg Tyr Cys lie 
305 310 

Asp Gin Met Glu Glu "Lys 
325 

(2) INFORMATION FOR SEQ ID NO: 213: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
TTGTCAGTCA GGTTCTAATG GTTCTCAGTC TGCTGTGGAT GCTATCAAAC AAAAAGGGAA ' 
ATTAGTTGTG GCAACCAGTC CTGACTATGC ACCCTTTGAA TTTCAATCAT TGGTTGATGG 



Ser Gly lie Tyr Val 
235 

Lys Asp Lys Phe Glu 
250 

lie Ser Pro Asp Val 
265 

Thr Arg Met Glu Val 
280 

Val Phe Thr Asp Gly 
295 

Asn Ser Leu Ser lie 
315 

Gly Thr Leu lie Tyr 
330 



Asp lie Ala Thr Gly 
240 

Ser Gly Cys Gly. Trp 
255 

Val Thr Tyr Lys Glu 
270 

Arg Ser Arg Val Gly 
285 

Pro Gin Asp Lys Gly 
300 

Arg PHe lie Pro Lys 
320 
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AAAGAACCAG GTAGTCGGTG CAGACATCGA CATGGCTCAG GCTATGGCTG ATGAACTTGG 180 

GGTTAAGTTG GAAATCTCAA GCATGAGTTT TGACAATGTT TTGACCAGTC TTCAAACTGG 240 

TAAGGCTGAC CTAGCAGTTG CAGGAATTAG TGCTAC TGAC GAGAGAAAAG AAGTCTTTGA 3 00 

TTTTTCAATC CCATACTATG AAAACAAGAT TAGTTTCTTG GTTC GTAAGG CTGATGTGGA 3 60 

AAAATACAAG GATTTAACTA GCCTAGAAAG TGCTAATATT GCAGCCCAAA AAGGGACTGT 420 

TCCAGAATCA ATGGTCAAGG AACAATTGCC AAAAGTTCAA TTAACTTCCC TAACTAATAT 480 

GGGTGAAGCA GTCAATGAAT TGCAGGCTGG AAAAATAGAT GCTGTTCATA TGGATGAGCC 540 

TGTTGCACTT AGTTATGCTG CTAAAAACGC TGGCTTAGCT GTCGCAACTG TCAGCTTGAA 600 

GATGAAGGAC GGCGACGCCA ATGCC 625 
(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids . 

(B) TYPE: amino acid 

tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: 

Cys Gin Ser Gly Ser Asn Gly Ser Gin Ser Ala Val Asp Ala lie Lys 

i 5 io is : 

Gin Lys Gly Lys Leu Val Val Ala Thr Ser Pro Asp Tyr Ala Pro Phe 
20 .25 30 

Glu Phe Gin Ser Leu Val Asp Gly Lys Asn Gin Val Val Gly Ala Asp 
35 40 45 

lie Asp Met Ala Gin Ala lie Ala Asp Glu Leu Gly Val Lys Leu Glu 
50 55 60 

lie Ser Ser Met Ser Phe. Asp Asn Val Leu Thr Ser Leu Gin Thr Gly 
65 70 75 80 

Lys Ala Asp Leu Ala Val Ala Gly lie Ser Ala Thr Asp Glu Arg Lys 
85 90 .95 

Glu Val Phe Asp Phe Ser He Pro Tyr ■ Tyr Glu Asn Lys He Ser Phe 
100 105 110 

Leu Val Arg Lys Ala Asp Val Glu Lys Tyr Lys Asp. Leu Thr Ser Leu 
115 : 120 . 125 

. Glu Ser Ala Asn He Ala Ala Gin Lys Gly Thr Val Pro Glu Ser Met 
130 135 140 

Val Lys Glu Gin Leu Pro Lys Val Gin Leu Thr Ser Leu Thr Asn Met _ . 
145. 150 155 * 160" 

Gly Glu Ala Val Asn Glu Leu Gin Ala Gly Lys He Asp Ala Val His 
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165 170 175 

Met Asp Glu Pro Val Ala Leu Ser Tyr Ala Ala Lys Asn Ala Gly Leu 
180 185 / 190 

Ala Val Ala Thr Val Ser Leu Lys Met Lys Asp Gly Asp Ala Asn Ala 
195 200 205 

(2) INFORMATION FOR SEQ- ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

GGAAACTTCA CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AAGAAGTTGT 60 

TAGTAAAAAT CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TCAAAGAAGA 120 

AAATTCCAAT AAATC CCAAG GAGATTATAC GGACTCATTT . GTGAAT AAAA ACACAGAAAA 180 

TCCCAAAAAA GAAGATAAAG TTGTC TATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA 240 

AAAAGCAATC AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG 3 00 

AATTTTTAAC GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT 3 60 

AGAAGGTATT TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA ATCATGCCAG 420 

AAAGGAAATT GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CTCCGTTTGG 48 0 

GAAAAATTTT GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG 540 

ACATAAGGCT ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AAAAAGAAGA 600 

CTTAAAAGGC ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA 660 

TTATAATGGT GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC 720 

ACATGGGATG CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA 780 

CTTTAACGGC ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AAATGTATTC 84 0 

TGACGCAGGA TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AAGATTCTAT 900 

CAAACACAAC GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTC TTGTAGG - 960 

TGAGAAATAT TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC 102 0 

TAC GGGTAAC TATGC GACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA 108 0 

TCTGAAAATG ACCGACACTG GAAATGTAAC . ACGAACTGCA GCACATGAAG ATGCGATAGC 1140 

GGTCGCTTCT GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG ' 120 0 

' TTTTAAATAC AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA 12 60 
TGGAACAAAA GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA " 13 2 0 
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TTTGATAGGT TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA 1380 

TTTAAAAAAT GCTTTTAAAA AAGC TATGG A TAAGGGTGCA CGCGCCATTA TGGTTGTAAA 1440 

TACTGTAAAT TACTACAATA GAGATAATTG GACAGAGCTT CCAGCTATGG GATATGAAGC 1500 

GGATGAAGGT ACTAAAAGTC AAGTGTTTTC AATTTCAGGA GATGATGGTG TAAAGCTATG 1560 

GAACATGATT AATCCTGATA AAAAAACTGA AGTCAAAAGA AATAATAAAG AAGATTTTAA 1620 

AGATAAATTG GAGCAATACT ATCCAATTGA TATGGAAAGT TTT AATTC C A ACAAACC GAA 1680 

TGTAGGTGAC GAAAAAGAGA TTGACTTTAA GTTTGCACCT GACACAGACA AAGAACTCTA 1740 

. TAAAGAAG AT ATCATCGTTC CAGCAGGATC TACATCTTGG GGGCCAAGAA TAGATTTACT 1800 

TTTAAAACCC GATGTTTCAG CACCTGGTAA AAATATTAAA TCCACGCTTA ATGTTATTAA 1860 

TGGCAAATCA ACTTATGGCT ATATGTCAGG AACTAGTATG - GCGACTCCAA TCGTGGCAGC 1920 

TTCTACTGTT TTGATTAGAC CGAAATTAAA GGAAATGCTT GAAAGACCTG TATTGAAAAA 1980 

TCTTAAGGGA GATGACAAAA TAGATCTTAC AAGTCTTACA AAAATTGCCC TACAAAATAC 2040 

TGCGCGACCT ATGATGGATG CAACTTCTTG GAAAGAAAAA AGTCAATACT TTGCATCACC 2100 

TAGACAACAG GGAGCAGGCC TAATTAATGT GGCCAATGCT TTGAGAAATG AAGTTGTAGC 2160 

AAC TTTC AAA AACACTGATT CTAAAGGTTT GGTAAAC TC A TATGGTTCCA TTTCTCTTAA 2220 

AGAAATAAAA GGTGATAAAA AATACTTTAC AATCAAGCTT CACAATACAT CAAACAGACC 2280 

TTTGACTTTT AAAGTTTCAG CATCAGCGAT AACTACAGAT TCTCTAACTG ACAGATTAAA 23 40 

ACTTGATGAA ACATATAAAG ATGAAAAATC TCCAGATGGT AAGCAAATTG TTCCAGAAAT 24 00 

TCACCCAGAA AAAGTCAAAG GAGCAAATAT CACATTTGAG CATGATACTT TCACTATAGG 24 60 

CGCAAATTCT AGCTTTGATT TGAATGCGGT TATAAATGTT GGAGAGGCCA AAAACAAAAA 2520 

TAAATTTGTA GAATCATTTA TTCATTTTGA GTCAGTGGAA . GCGATGGAAG CTCTAAACTC 2580 

CAGCGGGAAG AAAATAAACT TCCAACCTTC TTTGTC GATG CCTCTAATGG GATTTGCTGG 2 640 

GAATTGGAAC CACGAACCAA TCCTTGATAA ATGGGCTTGG GAAGAAGGGT CAAGATCAAA 27 00 

AACACTGGGA GGTTATGATG ATGATGGTAA ACCGAAAATT CCAGGAACCT TAAATAAGGG 2760 

AATTGGTGGA GAACATGGTA TAGATAAATT TAATCCAGCA GGAGTTATAC AAAATAGAAA 2820 

AGATAAAAAT ACAACATCCC TGGATCAAAA TCCAGAATTA TTTGCTTTCA ATAACGAAGG 2880 

GATCAACGCT CCATCATCAA GTGGTTCTAA GATTGCTAAC ATTTATCCTT TAGATTCAAA 2 940 

TGGAAATCCT CAAGATGCTC AACTTGAAAG AGGATTAACA CCTTCTCCAC TTGTATTAAG 3 000 

AAGTGCAGAA GAAGGATTGA TT ,3 022 

( 2 ) INFORMATION FOR SEQ ID NO:216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 

Glu Thr Ser Gin Asp Phe Lys Glu Lys Lys Thr. Ala Val lie Lys Glu 
1 5 10 15 

Lys Glu Val Val Ser Lys Asn Pro Val lie Asp Asn Asn Thr Ser Asn 
20 25 30 

Glu Glu Ala Lys lie Lys Glu Glu Asn Ser Asn Lys Ser Gin Gly Asp 
35 40 45 

Tyr Thr Asp Ser Phe Val Asn Lys Asn Thr Glu Asn Pro Lys Lys Glu 
50 55 60 

Asp Lys Val Val Tyr lie Ala Glu Phe Lys Asp Lys Glu Ser Gly Glu 
65 70 75 80 



Lys Ala lie Lys Glu Leu Ser Ser Leu Lys Asn Thr Lys Val Leu Tyr 
85 90 95 

Thr Tyr Asp Arg lie Phe Asn Gly Ser Ala lie Glu Thr Thr Pro Asp 
100 105 110 

Asn Leu Asp Lys lie Lys Gin lie Glu Gly lie Ser Ser Val Glu Arg 
115 120 v - 125 

Ala Gin Lys Val Gin Pro Met Met Asn His Ala Arg Lys Glu lie Gly 
130 135 140 

Val Glu Glu Ala lie Asp Tyr Leu Lys Ser lie Asn Ala Pro Phe Gly 
145 150 155 160 

Lys Asn Phe Asp Gly Arg Gly Met Val lie Ser Asn lie Asp Thr Gly 
165 r 170 175 

Thr Asp Tyr Arg His Lys Ala Met Arg lie Asp Asp Asp Ala Lys Ala 
180 . 185 190 

Ser Met Arg Phe Lys Lys Glu Asp Leu Lys Gly Thr Asp Lys Asn Tyr 
195 200 205 



Trp Leu Ser Asp Lys lie Pro His Ala Phe Asn Tyr Tyr Asn Gly Gly 

210 215 . 220 

Lys lie Thr Val Glu Lys Tyr Asp Asp Gly Arg Asp Tyr Phe Asp Pro 

225 230 ' 235 240 

His Gly Met His lie Ala Gly lie Leu Ala Gly Asn Asp Thr Glu Gin 

245 250 255 

Asp lie Lys Asn Phe Asn Gly lie Asp Gly lie Ala Pro Asn Ala Gin 

260 265 270 



lie Phe Ser Tyr Lys Met Tyr Ser Asp Ala Gly Ser Gly Phe Ala Gly 
275 . . 280 285 



* 

320 



Asp Glu Thr Met 
290 

Asp Val Val Ser 
305 

Glu Lys Tyr Trp 



Met Val Val Ala 
340 

Trp Asp Leu Val 
355 

Val Thr Arg Thr 

370 . 

Lys Asn Gin Thr 
385 ' 



Phe Lys Tyr Arg 



Thr Asn Glu Asp 
420 

lie Gly Lys Gly 
435 

Lys lie Ala Val 
450 

Phe Lys Lys Ala 
465 

Thr Val Asn Tyr 



Gly Tyr Glu Ala 
500 

Gly Asp Asp Gly 
515 

Thr Glu Val Lys 
530 

Gin Tyr Tyr Pro 
545 

Val Gly Asp Glu 



Lys Glu Leu Tyr 
580 

Trp Gly Pro Arg 



Phe Hi's Ala lie 
295 

Val Ser Ser Gly 
310 . - 

Gin Ala lie Arg 
325 

Thr Gly Asn Tyr 



Ala Asn Asn His 
360 

Ala Ala His Glu 
375 

Val Glu Phe Asp 
390 

Asn lie Gly Ala 
405 

Gly Thr Lys Ala 



Gin Asp Gin Asp 
440 

Met Asp Arg lie 
455 

Met Asp Lys Gly 
470 

Tyr Asn Arg Asp 
485 

Asp Glu Gly Thr 



Val Lys Leu Trp 
520 



Arg Asn Asn Lys 
535 

lie Asp Met Glu 

550 . 

Lys Glu lie Asp 
565 

Lys Glu Asp lie 



lie Asp Leu Leu 
600 



Glu Asp Ser lie 
300 

Phe Thr Gly Thr 
315 

Ala Leu Arg Lys 
330 

Ala Thr Ser Ala 

345 . 

Leu Lys Met Thr 



Asp Ala lie Ala 
380 

Lys Val Asn lie 
395 

Phe Phe Asp Lys 
410 

Pro Ser Lys Leu 
425 

Leu lie Gly Leu 



Tyr Thr Lys Asp 
460 

Ala Arg-Ala lie 
475 

Asn Trp Thr Glu 
490 

Lys Ser Gin Val 
505 

Asn Met lie Asn 



Glu Asp Phe Lys 
540 

Ser Phe Asn Ser 
555 

Phe Lys Phe Ala 
570 

lie Val Pro Ala 
585 ' 

Leu Lys Pro Asp 



Lys His Asn Val 



Gly Leu Val Gly 
320 

Ala Gly lie Pro 
335 

Ser Ser Ser Ser 
350 

Asp Thr Gly Asn 
365 

Val Ala Ser Ala 



Gly Gly Glu Ser 
400 

Ser Lys lie Thr 
415 

Lys Phe Val Tyr 
430 

Asp Leu Arg Gly 
445 

Leu Lys Asn Ala 



Met Val Val Asn 
' 480 

Leu Pro Ala Met 
495 

Phe Ser lie Ser 
510 

Pro Asp Lys Lys 
525 

Asp Lys Leu Glu 



Asn Lys Pro Asn 
560 

Pro Asp Thr Asp 
57 5 

Gly Ser Thr Ser 
590 

Val Ser Ala . Pro 
605 



Gly Lys Asn lie Lys Ser Thr Leu Asn Val lie Asn Gly Ly.s Ser Thr 
610 615 620 



Tyr Gly Tyr Met 
625 

Ser Thr Val Leu 



Val Leu Lys Asn 
660 

Thr Lys lie Ala 
675 

Ser Trp Lys Glu 
690 

Ala Gly Leu lie 
705 

Thr Phe Lys Asn 



lie Ser Leu Lys 
740 

Leu His Asn Thr 
755 

Ala lie Thr Thr 
770 

Tyr Lys Asp Glu 
785 

His Pro Glu Lys 



; Phe Thr lie Gly 
820 

Val Gly Glu Ala 
835 

Phe Glu Ser Val 
850 

lie Asn Phe Gin 
865 

Asn Trp Asn His 



Ser Arg Ser Lys 
900 

lie Pro Gly Thr 
915 

Lys Phe Asn Pro 
930 



Ser Gly Thr Ser 
630 

lie Arg Pro Lys 
645 

Leu Lys Gly Asp 



Leu Gin Asn Thr 
680 

Lys Ser Gin Tyr 
695 

Asn Val Ala Asn 
710 

Thr Asp Ser Lys 
725 

Glu lie Lys Gly 



Ser Asn Arg Pro 
760 

Asp Ser Leu Thr 
775 

Lys Ser Pro Asp 
790 

Val Lys Gly Ala 
805 

Ala Asn Ser Ser 



Lys Asn Lys Asn 
840 

Glu Ala Met Glu 
855 

Pro Ser Leu Ser 
870 

Glu Pro lie Leu 
885 

Thr Leu Gly Gly 



Leu Asn Lys Gly 
920 

Ala Gly Val- lie 
935 



Met Ala Thr Pro. 
635 

Leu Lys Glu Met 
650 

Asp Lys lie Asp 
665 

Ala Arg Pro Met 



Phe Ala Ser Pro 
700 

Ala Leu Arg Asn 
715 

Gly Leu Val Asn 
730 

Asp Lys Lys Tyr 
745 

Leu Thr Phe Lys 



Asp Arg Leu Lys 
780 

Gly Lys. Gin lie 
795 

Asn lie Thr Phe 

'810 

Phe Asp Leu Asn 
825 

Lys Phe Val Glu 



Ala Leu Asn Ser 
860 

Met Pro Leu Met 
875 

Asp Lys Trp Ala 
890 

Tyr Asp Asp Asp 
905 

lie Gly Gly Glu 



Gin Asn Arg Lys 
. 940 



lie Val Ala Ala 
640 

Leu Glu Arg Pro 
655 

Leu Thr Ser Leu 
670 

Met Asp Ala 'Thr 
685 

Arg Gin Gin Gly 



Glu Val Val Ala 
.720 

Ser Tyr Gly -Ser 
735 

Phe Thr lie Lys 
750 

Val Ser Ala Ser 
765 

Leu Asp Glu Thr 



Val Pro Glu lie 
800 

Glu His Asp Thr 
815 

Ala Val lie Asn 
830 

Ser Phe He His 
845 

Ser Gly Lys Lys 



Gly Phe Ala Gly 
880 

Trp Glu Glu Gly 
895 

Gly Lys Pro Lys 
910 

His Gly He Asp 
925 

Asp Lys Asn Thr 



Thr_Ser. Leu Asp Gin Asn Pro Glu Leu Phe Ala Phe Asn Asn Glu Gly 
945 950 955 960 
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lie Asn Ala Pro Ser Ser Ser Gly Ser Lys lie Ala Asn lie Tyr Pro 
965 970 975 

Leu Asp Ser Asn Gly Asn Pro Gin Asp^Ala Gin Leu Glu Arg Gly Leu 
980 985 ' 990 

Thr Pro Ser Pro Leu Val Leu Arg Ser Ala Glu Glu Gly Leu lie 
995 1000 1005 

(2) INFORMATION FOR. SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : ' 1696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 
TGTGGTCGAA GTTGAGACTC CTCAATCAAT AACAAATCAG GAGCAAGCTA GGACAGAAAA 60 
CCAAGTAGTA GAGACAGAGG AAGCTCCAAA AGAAGAAGCA C CTAAAAC AG AAGAAAGTCC 120 
AAAGGAAGAA CCAAAATCGG AGGTAAAACC TACTGACGAC . ACCCTTCCTA AAGTAGAAGA 180 
GGGGAAAGAA GATTCAGCAG AACCAGCTCC AGTTGAAGAA GTAGGTGGAG AAGTTGAGTC 240 
AAAACCAGAG GAAAAAGTAG CAGTTAAGCC AGAAAGTCAA CCATCAGACA AACCAGCTGA 3 00 

GGAATCAAAA GTTGAACAAG C AGGTGAAC C AGTCGCGCCA AGAGAAGACG AAAAGGCACC 3 60 

AGTCGAGCCA GAAAAGCAAC CAGAAGCTCC TGAAGAAGAG AAGGCTGTAG AGGAAACACC 420 
GAAACAAGAA GAGTCAACTC CAGATACCAA GGCTGAAGAA ACTGTAGAAC CAAAAGAGGA 480- 
GACTGTTAAT CAATCTATTG AAC AAC C AAA AGTTGAAACG CCTGCTGTAG AAAAACAAAC 540 
AGAACCAACA GAGGAACCAA AAGTTGAACA AGCAGGTGAA CCAGTCGCGC CAAGAGAAGA 600 
CGAACAGGCA CCAACGGCAC CAGTTGAGCC AGAAAAGCAA C C AG AAGTTC CTGAAGAAGA 660 
GAAGGCTGTA GAGGAAACAC CGAAACCAGA AGATAAAATA AAGGGTATTG GTACTAAAGA 720 
ACCAGTTGAT AAAAGTGAGT TAAATAATCA AATTGATAAA GCTAGTTCAG TTTCTCCTAC 780 
TGATTATTCT. ACAGCAAGTT ACAATGCTCT TGGACCTGTT TTAGAAACTG CAAAAGGTGT 840 
CTATGCTTCA GAGCCTGTAA AACAGCCTGA GGTAAATAGC GAGACAAATA AACTTAAAAC 900 
GGCTATTGAC GCTCTAAACG TTGATAAAAC TGAATTAAAC AATACGATTG CAGATGCAAA 960 

AACAAAGGTA AAAGAACATT ACAGTGATAG AAGTTGGCAA AACCTCCAAA CTGAAGTTAC 1020 

AAAGGCTGAA AAAGTTGCAG CTAATACAGA TGCTAAACAA AGTGAAGTTA ACGAAGCTGT 1080 

TGAAAAATTA ACTGCAACTA TTGAAAAATT GGTTGAATTA TCTGAAAAGC CAATATTAAC 1140 

ATTGACTAGT ACCGATAAGA AAATATTGGA ACGTGAAGCT GTTGCTAAGT ATACTCTAGA 1 20 0 

AAATCAAAAC AAAACAAAAA TCAAATCAAT CACAGCTGAA TTGAAAAAAG GAGAAGAAGT 12 60 
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TATTAATACT GTAGTCCTTA C AG AT G AC AA GGTAACAACA ■ GAAACTATAA GCGCTGCATT 132 0 

TAAGAAC CTA GAGTACTACA AAGAATACAC CCTATCTACA ACTATGATTT ACGACAGAGG 138 0 

TAACGGTGAA GAAACTGAAA CTCTAGAAAA TCAAAATATT CAATTAGATC TTAAAAAAGT 1440 

TGAGCTTAAA AATATTAAAC GTACAGATTT AATCAAATAC GAAAATGGAA AAGAAACTAA 150.0 

TGAATCACTG ATAACAACTA TTC CTGATGA TAAGAGCAAT TATTATTTAA AAATAACTTC 1560 

AAATAATCAG AAAACTACAT . TACTAGCTGT TAAAAATATA GAAGAAACTA CGGTTAACGG 162 0 

AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA C TGCTGAT AA 1680 

TAAATTTGAA GAAGAA 1696 

(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 565 amino acids - - . - . 

(B) TYPE: amino acid -- ---- 1 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

Val Val Glu Val Glu Thr Pro Gin Ser lie Thr Asn Gin Glu Gin Ala 
1 5 10 15 

Arg Thr Glu Asn Gin Val Val. Glu Thr Glu Glu Ala Pro Lys Glu Glu 
20 .. 25 30 

Ala Pro Lys Thr Glu Glu Ser Pro Lys Glu Glu Pro Lys Ser Glu Val 
35 40 45 

' Lys Pro Thr Asp Asp Thr Leu Pro Lys Val Glu Glu Gly Lys Glu Asp . 
50 55 60 

Ser Ala Glu Pro Ala Pro Val .Glu Glu Val Gly Gly Glu Val Glu Ser 
65 70 75 80 

Lys Pro Glu Glu Lys Val Ala Val Lys Pro Glu Ser Gin Pro Ser Asp 
85 90 95 

Lys Pro Ala Glu Glu Ser Lys Val Glu Gin Ala Gly Glu Pro Val Ala 
100 105 110 

Pro Arg. Glu Asp Glu Lys Ala Pro Val Glu Pro Glu Lys Gin Pro Glu 
115 120 125. 

Ala Pro Glu Glu Glu Lys Ala Val Glu Glu Thr Pro Lys Gin Glu Glu 
130 135 140 . 

Ser Thr Pro Asp Thr Lys Ala Glu Glu Thr Val Glu Pro Lys Glu Glu 
' 145 150 155 160 

Thr Val Asn Gin Ser He Glu Gin Pro Lys Val Glu Thr Pro Ala Val 

165 170 ' • 17 5.. 



Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys Val Glu Gin Ala Gly 



180 
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Glu Pro Val Ala Pro Arg Glu Asp Glu Gin Ala Pro Thr Ala Pro Val 

195 _ 200 ..... 205 

Glu Pro Glu Lys Gin Pro Glu Val Pro Glu Glu Glu Lys Ala Val Glu 
210 215 220 

Glu Thr Pro Lys Pro Glu Asp Lys lie Lys Gly lie Gly Thr Lys Glu 
225 230 235 .240 

Pro Val Asp Lys Ser Glu Leu Asn Asn Gin lie Asp Lys Ala Ser Ser 
245 250 ■ 255 

Val Ser Pro Thr Asp Tyr Ser Thr Ala Ser Tyr Asn Ala Leu Gly Pro 
260 265 .270 

Val Leu Glu Thr Ala Lys Gly Val Tyr Ala Ser Glu Pro Val Lys Gin 
275 280 285 

Pro Glu Val Asn Ser Glu Thr Asn Lys Leu Lys Thr Ala ^ He Asp Ala- 
290 295 300 

Leu Asn Val Asp Lys Thr Glu Leu Asn Asn Thr He Ala Asp* Ala Lys 
305 310 315 320 

Thr Lys Val Lys Glu His Tyr Ser Asp Arg Ser Trp' Gin Asn Leu Gin 
325 330 335 

Thr Glu Val Thr Lys Ala Glu Lys Val Ala Ala Asn Thr Asp Ala Lys 
340 345 350 

Gin Ser Glu Val Asn Glu Ala Val Glu Lys Leu Thr Ala Thr He Glu 
355 360 / 365 

Lys Leu Val Glu Leu Ser Glu Lys Pro He Leu Thr Leu Thr Ser Thr 
370 375 - 380 

Asp Lys Lys He Leu Glu Arg Glu Ala Val Ala Lys Tyr Thr Leu Glu 
385 390 395 400 

Asn Gin Asn Lys Thr Lys He Lys Ser He Thr Ala Glu Leu Lys Lys 
405 410 415 

Gly Glu Glu Val He Asn Thr Val Val Leu Thr Asp Asp Lys Val Thr 
420 425 430 

Thr Glu Thr He Ser Ala Ala Phe Lys Asn Leu Glu Tyr Tyr Lys Glu 
435 440 445 

Tyr Thr Leu Ser Thr Thr Met lie Tyr Asp Arg Gly Asn Gly Glu Glu 
450 455 460 

Thr Glu Thr Leu Glu Asn Gin Asn, He Gin Leu Asp Leu Lys Lys Val 
465 470 475 , 480 

Glu Leu Lys Asn He Lys Arg Thr Asp Leu lie Lys Tyr Glu Asn Gly 
485 490 495 

Lys Glu Thr Asn Glu Ser Leu He Thr Thr He Pro Asp Asp Lys Ser- 
500 " 505 510 



Asn Tyr Tyr Leu Lys He Thr Ser Asn Asn Gin Lys Thr Thr Leu Leu 
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520 



525 



Ala Val Lys Asn lie Glu Glu Thr Thr Val Asn Gly Thr Pro Val Tyr 
5.30 535 , 540 

Lys Val Thr Ala lie Ala Asp Asn Leu Val Ser Arg Thr Ala Asp Asn 
545 550 555 560 

Lys Phe Glu Glu Glu 
565 

(2) INFORMATION FOR SEQ ID NO : 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187-9 base pairs 
(B.) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 
AACACCTGTA TATAAAGTTA CAGCAATCGC AGACAATTTA GTCTCTAGAA CTGCTGATAA 
TAAATTTGAA GAAGAATACG TTCACTATAT TGAAAAACCT AAAGTCCACG AAGATAATGT 
ATATTATAAT TTCAAAGAAT TAGTGGAAG C TATTCAAAAC GATCCTTCAA AAGAATATCG 
TCTGGGACAA TCAATGAGCG CTAGAAATGT TGTTCCTAAT GGAAAATCAT ATATCACTAA 
AGAATTCACA GGAAAACTTT TAAGTTCTGA AGGAAAACAA TTTGCTATTA CTGAATTGGA 
ACATCCATTA TTTAATGTGA TAACAAACGC AACGATAAAT AATGTGAATT TTGAAAATGT 
AGAGATAGAA CGTTCTGGTC AAGATAATAT TGCATCATTA GCCAATACTA TGAAAGGTTC 
TTCAGTTATT ACAAATGTCA AAATTACAGG CACACTTTCA GGTC GTAATA ATGTTGCTGG 
ATTTGTAAAT AATATGAATG ATGGAACTCG TATTGAAAAT GTTGCTTTCT TTGGCAAACT 
ACACTCTACA AGTGGAAATG GCTCTCATAC AGGGGGAATT GCAGGTACAA ACTATAGAGG 
AATTGTTAGA AAAGCATATG TTGATGCTAC TATTACAGGA AACAAAACAC GCGCCAGCTT , 
GTTAGTTCCT AAAGTAGATT ATGGATTAAC TCTAGACCAT CTTATTGGTA CAAAAGCTCT 
CCTAACTGAG TCGGTTGTAA AAGGTAAAAT AGATGTTTCA AATCCAGTAG AAGTTGGAGC 
AATAGCAAGT AAGACTTGGC CTGTAGGTAC GGTAAGTAAT TCTGTCAGCT ATGCTAAGAT 
TATCCGTGGA GAGGAGTTAT TCGGCTCTAA CGACGTTGAT GATTCTGATT ATGCTAGTGC 
TCATATAAAA GATTTATATG CGGTAGAGGG ATATTCGTCA GGTAATAGAT CATTTAGGAA 
ATC TAAAAC A TTTACTAAAT TAACTAAAGA ACAAGCTGAT GCTAAAGTTA CTACTTTCAA 
TATTACTGCT GATAAATTAG AAAGTGATCT ATCTCCTCTT GCAAAACTTA ATGAAGAAAA 
AGCCTATTCT AGTATTCAAG ATTATAACGC TGAATATAAC CAAGCCTATA AAAATCTTGA 
AAAATTAATA C C ATTC T AC A ATAAAGATTA TATTGTATAT CAAGGTAATA AATTAAATAA 
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AGAACACCAT" CTAAATACTA AAGAAGTTCT TTCTGTTACC GCGATGAACA ACAATGAGTT 12 60 

TATCACAAAC CTAGATGAAG CTAATAAAAT TATTGTTCAC TATGCGGACG GTACAAAAGA 132 0 

TTACTTTAAC TTGTCTTCTA GCAGTGAAGG TTTAAGTAAT GTAAAAGAAT ATACTATAAC 138 0 

TGACTTAGGA ATTAAATATA CACCTAATAT CGTTCAAAAA GAT AAC AC T A CTCTTGTTAA 1440 

TGATATAAAA TCTATTTTAG AATCAGTAGA GCTTCAGTCT CAAACGATGT ATCAGCATCT 1500 

AAATCGATTA GGTGACTATA GAGTTAATGC AATCAAAGAT TTATATTTAG AAGAAAGCTT 1560 

CACAGATGTT AAAGAAAACT TAACAAACCT AATCACAAAA TTAGTTCAAA ACGAAGAACA 162 0 

TCAACTAAAT GATTCTCCAG CTGCTCGTCA AATGATTCGT GATAAAGTCG AGAAAAACAA 168 0 

AGCAGCTTTA TTACTAGGTT TAACTTACCT AAATCGTTAC TATGGAGTTA AATTTGGTGA 174 0 

TGTTAATATT AAAGAATTAA- TGCTATTCAA ACCAGATTTC TATGGTGAAA AAGTTAGCGT 1800 

J ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT CACGTACATT 1860 

.11; CGACGCATTC GGTCAAGTA 1879 

(2) INFORMATION FOR SEQ ID NO: 22 0: 

(i) SEQUENCE "CHARACTERISTICS : 
i=£ (A) LENGTH: 62 6 amino acids 

3; (B) TYPE: amino acid 

i~ t (C) STRANDEDNESS : single 

2 <E>) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

M; (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

Thr Pro Val Tyr Lys Val Thr Ala lie Ala Asp Asn Leu Val Ser Arg 
1 ' 5 10 15 

Thr Ala Asp Asn Lys Phe Glu Glu Glu Tyr Val His Tyr lie Glu Lys 
20 25 30 

Pro Lys Val His Glu Asp Asn Val Tyr Tyr Asn Phe Lys Glu Leu Val 
35 40 45 

Glu Ala lie Gin Asn Asp Pro Ser Lys Glu Tyr Arg Leu Gly Gin Ser 
50 55 60 - 

Met Ser Ala Arg Asn Val Val Pro Asn Gly Lys Ser Tyr lie Thr Lys ' 
65 70 75 80 

Glu Phe Thr Gly Lys Leu Leu Ser Ser Glu Gly Lys Gin Phe Ala lie 
85 90 95 

- Thr Glu Leu Glu His Pro Leu Phe Asn Val lie Thr Asn Ala Thr. lie 

100 105 110 

Asn Asn Val Asn Phe Glu Asn Val Glu lie Glu Arg Ser .Gly Gin Asp 
: 115 " 120 - 125 

Asn lie Ala Ser Leu Ala Asn - Thr* Met Lys Gly Ser Ser Val lie Thr 
130 135 140 
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Asn Val Lys lie Thr Gly Thr Leu Ser Gly Arg Asn Asn Val Ala Gly 
145 150 155 160 

Phe Val Asn Asn Met Asn Asp Gly Thr Arg lie Glu Asn Val Ala Phe 
165 170 175 

Phe Gly Lys Leu His Ser Thr Ser Gly Asn Gly Ser His Thr Gly Gly 
180 . 185 190 

lie Ala Gly Thr Asn Tyr Arg Gly lie Val Arg Lys Ala Tyr Val Asp 
195 200 205 

Ala Thr lie Thr Gly Asn Lys Thr Arg Ala Ser Leu Leu Val Pro Lys 
210 215- 220 

Val Asp Tyr Gly Leu Thr Leu Asp His Leu lie Gly Thr Lys Ala Leu 
225 230 235 240 

Leu Thr Glu Ser Val Val Lys Gly Lys lie Asp Val Ser Asn Pro Val 

245 250 — - ■ 255 

Glu Val Gly Ala lie Ala Ser Lys Thr Trp Pro Val Gly Thr Val Ser 
260 265 270 

Asn Ser Val Ser Tyr Ala Lys lie lie Arg Gly Glu Glu Leu Phe Gly 
275 280 285 

Ser Asn Asp Val Asp Asp Ser Asp Tyr Ala Ser Ala His lie Lys Asp 
290 295 300 

Leu Tyr Ala Val Glu Gly Tyr Ser Ser Gly Asn Arg Ser Phe Arg Lys 
.305 310 315 320 



Ser Lys Thr Phe Thr Lys Leu Thr Lys Glu Gin Ala Asp Ala Lys Val 
325 330 335 

Thr Thr Phe Asn lie Thr Ala Asp Lys Leu Glu Ser Asp Leu Ser Pro 
340 345 350 

Leu Ala Lys Leu Asn Glu Glu Lys Ala Tyr Ser Ser lie Gin Asp Tyr 
355 360 365 

Asn Ala Glu Tyr Asn Gin Ala- Tyr Lys Asn Leu Glu Lys Leu He Pro 
370 375 380 

Phe Tyr Asn Lys Asp Tyr He Val Tyr Gin Gly Asn Lys Leu Asn Lys 
385 390 395 400 

Glu His His Leu Asn Thr Lys Glu Val Leu Ser Val Thr Ala Met Asn 
405 410 415 

Asn Asn Glu Phe He Thr Asn Leu Asp Glu Ala Asn Lys lie He - Val 
420 425 430 ' 

His Tyr Ala Asp Gly Thr Lys Asp Tyr. Phe Asn Leu Ser Ser Ser Ser 
435 440 445 

Glu Gly Leu Ser Asn Val Lys Glu Tyr Thr He Thr Asp Leu Gly- Tie 
45.0 455 460 

Lys Tyr Thr Pro Asn He Val__Gln Lys Asp Asn Thr Thr Leu Val Asn 
465 470 475 -480 
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Asp lie Lys Ser lie Leu Glu Ser Val Glu Leu Gin Ser Gin Thr Met 
485 490 495 

Tyr Gin His Leu Asn Arg Leu Gly Asp Tyr Arg Val Asn Ala lie Lys 
500 505 510 

Asp Leu Tyr Leu Glu Glu Ser Phe Thr Asp Val Lys Glu Asn Leu Thr 
515 520 525 

Asn Leu lie Thr Lys Leu Val Gin Asn Glu Glu His Gin Leu Asn Asp 
530 535 540 

Ser Pro Ala Ala Arg Gin Met lie Arg Asp Lys Val Glu Lys Asn Lys 
545 550 555 560 

Ala Ala Leu Leu Leu Gly Leu Thr Tyr Leu Asn Arg Tyr Tyr Gly Val 
565 570 575 

Lys Phe Gly Asp Val Asn lie Lys Glu Leu Met Leu Phe Lys Pro Asp 
580 585 v 590 

Phe Tyr Gly Glu Lys Val Ser Val Leu Asp Arg Leu lie Glu lie Gly 
595 600 605 

Ser Lys Glu Asn Asn lie Lys Gly Ser Arg Thr Phe Asp Ala Phe Gly 
610 615 620 

Gin Val 
625 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS : " .■ 

(A) LENGTH: 1777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

ATTAGACAGA TTAATTGAAA TCGGTTCTAA AGAGAACAAC ATTAAAGGTT C AC GT AC ATT 60 

CGACGCATTC GGTCAAGTAT TGGCTAAATA T AC TAAATC A . GGTAATTTAG ATGCATTTTT 12 0 

AAATTATAAT AGACAATTGT TCACAAATAT AGACAATATG AACGATTGGT TTATTGATGC 180 

TACAGAAGAC CATGTCTACA TCGCAGAACG CGCTTCTGAG GTC G AAG AAA TTAAAAATTC 240 

TAAACATCGT GCATTCGATA ATTTAAAACG AAGTCACCTT AGAAATACTA TACTCCCACT 3 00. 

ACTGAATATT GATAAAGCAC ATCTTTATTT AATTTCAAAT TATAATGCAA TTGCCTTTGG 3 60 
TAGTGCAGAG CGATTAGGTA AAAAATCATT AGAAGATATT AAAGATATCG TTAACAAAGC - 420 

TGCAGATGGT TATAGAAACT ATTATGATTT CTGGTATCGT CTAGCGTCTG ATAACGTTAA 48 0 

ACAACGACTA CTAAGAGATG CTGTTATTCC TATTTGGGAA GGTTATAACG CTCCTGGTGG 54 0 

ATGGGTTGAA AAATATGGCC GCTATAATAC CGACAAAGTA TATACTCCTC TTAGAGAATT 600 
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CTTTGGTCCT ATGGATAAGT ATTATAATTA TAATGGAACA GGAGCTTATG C TGCTAT ATA 660 
TCCTAACTCT GATGATATTA GAACTGATGT AAAATATGTT CATTTAGAAA TGGTTGGTGA . 720 
ATACGGTATT TCAGTTTACA CACATGAAAC AACACACGTC AACGACCGTG CGATTTACTT 780 
AGGTGGCTTT GGACACCGTG AAGGTACTGA TGCTGAAGCA TATGCTCAGG GTATGCTACA 840 
AACTCCTGTT ACTGGTAGTG GATTTGATGA GTTTGGTTCT TTAGGTATTA ATATGGTATT 900 ' 

TAAAC GC AAA AATGATGGGA ATCAGTGGTA TATTACAGAT CCAAAAACTC TAAAAACACG 960 

AGAAGATATT AATAGATATA TGAAGGGTTA TAATGACACT TTAACTCTTC TTGATGAAAT 102 0 

TGAGGCTGAA TCTGTGATTT CTCAACAAAA TAAAGATTTA AATAGTGCAT GGTTCAAAAA 1080 

AAT AG AT AG A GAATACC GTG ATAACAATAA ATTAAATCAA TGGGATAAAA TTCGAAATCT 1140 

AAGTCAAGAA GAGAAAAATG AATTAAATAT TCAATC TGTT AATGATTTAG TTGATCAACA 12 00 

ATTAATGACT AATCGCAATC CAGGTAATGG TATCTATAAA CCCGAAGCAA TTAGCTATAA 1260 

CGATCAATCA CCTTATGTAG GTGTTAGAAT GATGACCGGT ATCTAC GGAG GTAATACTAG 1320 

TAAAGGTGCT CCTGGAGCTG TTTCATTCAA ACATAATGCT TTTAGATTAT GGGGTTACTA 1380 

CGGATACGAA AATGGGTTCT TAGGTTATGC TTCAAATAAA TATAAACAAC AATCTAAAAC 1440 

AGATGGTGAG TCTGTTCTAA GTGATGAATA TATTATCAAG AAAATATCTA ACAATACATT 1500 

TAATACTATT GAAGAATTTA AAAAAGCTTA CTTCAAAGAA GTTAAAGATA AAGCAACGAA 1560 

AGGATTAACA ACATTCGAAG TAAATGGTTC TTCCGTTTCA TCATACGATG ATTTAC TG AC 162 0 

ATTGTTTAAA GAAGCTGTTA AAAAAGATGC CGAAACTCTT AAACAAGAAG CAAACGGTAA 1680 

TAAAACAGTA TCTATGAATA ATACAGTTAA ATTAAAAGAA GCTGTTTATA AGAAAC TTCT 1740 

TCAACAAACA AATAGCTTTA AAACTTCAAT CTTTAAA 1777 



(2) INFORMATION FOR SEQ ID NO: 222: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 592 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

Leu Asp Arg Leu He Glu He Gly Ser Lys Glu Asn Asn He Lys Gly 
.1 5 10 15 

Ser Arg Thr Phe Asp Ala Phe Gly Gin Val.Leu Ala Lys Tyr Thr Lys 
20 25 30 

Ser Gly Asn Leu Asp Ala Phe Leu Asn Tyr Asn Arg Gin Leu Phe Thr ' 

35 40 45 

Asn He Asp Asn Met Asn Asp Trp Phe lie Asp Ala Thr Glu Asp His 
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50 .55 .60 

Val Tyr lie Ala Glu Arg Ala Ser Glu Val Glu Glu lie Lys Asn Ser 
65 70 75 80 

Lys His Arg Ala Phe Asp Asn Leu> Lys Arg Ser His Leu Arg Asn Thr 
85 90 95 

lie Leu Pro Leu Leu Asn lie Asp Lys Ala His Leu Tyr Leu lie Ser 
100 105 110 

Asn Tyr Asn Ala lie Ala Phe Gly Ser Ala Glu Arg Leu Gly Lys Lys 
115 120 125 

Ser Leu Glu Asp lie Lys Asp lie Val Asn Lys Ala Ala Asp Gly Tyr 
130 135 140 

Arg Asn Tyr Tyr Asp Phe Trp Tyr Arg Leu Ala Ser Asp Asn Val Lys 
145 150 155 160 

Gin Arg Leu Leu Arg Asp Ala Val lie Pro lie Trp Glu Gly Tyr Asn 
165 170 175 

Ala Pro Gly Gly Trp Val Glu Lys Tyr Gly Arg Tyr Asn Thr Asp Lys 
180 185 190 

Val Tyr Thr Pro Leu Arg Glu Phe Phe Gly Pro Met Asp Lys Tyr Tyr 
195 200 205 

Asn Tyr Asn Gly Thr Gly Ala Tyr Ala Ala lie Tyr Pro Asn Ser Asp 
210 215 220 

Asp lie Arg Thr Asp Val Lys Tyr Val His Leu Glu Met Val Gly Glu 
225 230 235 240 

Tyr Gly lie Ser Val Tyr Thr His Glu Thr Thr His Val Asn Asp Arg 

245 250 . ,255 . 

Ala lie Tyr Leu Gly Gly Phe Gly His Arg Glu Gly Thr Asp Ala Glu 
260 265 270 

Ala Tyr Ala Gin Gly Met Leu Gin Thr Pro Val Thr Gly Ser Gly Phe 
275 280 285 

Asp Glu Phe Gly Ser Leu Gly lie Asn Met Val Phe Lys Arg Lys Asn 
290 295 300 

Asp Gly Asn Gin Trp Tyr lie Thr Asp Pro Lys Thr Leu Lys Thr Arg 
305 310 315 320 

Glu Asp lie Asn Arg Tyr Met Lys Gly Tyr Asn Asp Thr Leu Thr Leu 
325 . 330 335 

Leu Asp Glu lie Glu Ala Glu Ser Val lie Ser Gin Gin Asn Lys Asp 
340 s 345 350 

Leu Asn Ser Ala Trp Phe Lys Lys lie Asp Arg Glu Tyr -Arg Asp Asn 
- 355 360 365 

Asn Lys Leu Asn Gin Trp Asp Lys lie Arg Asn Leu Ser Gin Glu Glu 
370 375 ■ 380 

Lys Asn Glu Leu Asn lie Gin Ser Val Asn Asp Leu Val Asp Gin Gin 
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385 390 395 400 

Leu Met Thr Asn Arg Asn Pro Gly Asn Gly lie Tyr Lys Pro Glu Ala 
405 410 415 

lie Ser Tyr Asn Asp Gin Ser Pro Tyr Val Gly Val Arg Met Met Thr 
420 425 430 

Gly lie Tyr Gly Gly Asn Thr Ser Lys Gly Ala Pro Gly Ala Val Ser 
435 440 445 

Phe Lys His Asn Ala Phe Arg Leu Trp Gly Tyr Tyr Gly Tyr Glu Asn 
450 455 460 

Gly Phe Leu Gly Tyr Ala Ser Asn Lys Tyr Lys Gin Gin Ser Lys Thr 
465 ^ 470 475 480 

Asp Gly Glu Ser Val Leu Ser Asp Glu Tyr lie lie Lys Lys lie Ser 
485 490 495 

Asn Asn Thr Phe Asn Thr lie Glu Glu Phe Lys Lys Ala Tyr Phe Lys 
500 505 510 

Glu Val Lys Asp Lys Ala Thr Lys Gly Leu Thr Thr Phe Glu Val Asn 
515 520 525 

Gly Ser Ser Val Ser Ser Tyr Asp Asp Leu Leu Thr Leu Phe Lys Glu 
530 535 540 

Ala Val Lys Lys Asp Ala Glu Thr Leu Lys Gin Glu Ala Asn Gly Asn 
545 550 555 560 

Lys Thr Val Ser Met Asn Asn Thr Val Lys Leu Lys Glu Ala Val Tyr 
565 570 575 

Lys Lys Leu Leu Gin Gin Thr Asn Ser- Phe Lys Thr Ser lie Phe Lys 
580 585 ' 590 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
TAAGACAGAT GAACGGAGCA AGGTGTTTGA CTTTTCCATT CCCTACTATA CTGCAAAAAA 60 

TAAACTCATT GTCAAAAAAT CTGACTTGAC TACTTATCAG TCTGTAAACG ACTTGGCGCA 120 

GAAAAAGGTT GGAGCGCAGA AAGGTTCGAT TCAAGAGACG ATGGCGAAAG ATTTGCTACA 180 

AAATTCTTCC CTCGTATCTC TGCCTAAAAA TGGGAATTTA ATCACAGATT TAAAATCAGG 240 

ACAAGTGGAT GCCGTTATCT TTGAAGAACC TGTTTCCAAG GGATTTGTGG AAAATAATCC 3 00 

TGATTTAGCA" ATCGCAGACC TCAATTTTGA AAAAGAGCAA GATGATTCCT AC G C GGTAGC 3 60 

CATgAAAAAA GATAGCAAGA AATTGAAGAG GCAGTTCGAT AAAACCATTC AAAAGTTGAA 420 
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GGAGTCTGGG GAATTAGACA AACTCATTGA GGAAGCCTTA " 46 0 

_(2J_ INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Lys Thr Asp Glu Arg- Ser Lys Val Phe Asp Phe Ser lie Pro Tyr Tyr 
"1 5 .10 15 

-Thr Ala Lys Asn Lys Leu lie Val Lys Lys Ser Asp Leu Thr Thr Tyr 
20 25 30 



Gin Ser Val Asn Asp Leu Ala Gin Lys Lys Val Gly Ala Gin Lys Gly 
35 40 45 

Ser lie Gin Glu Thr Met Ala Lys Asp Leu Leu Gin Asn Ser Ser Leu 
50 55 60 

Val Ser Leu Pro Lys Asn Gly Asn Leu lie Thr Asp Leu Lys Ser Gly 
65 70 75 80 

Gin Val Asp Ala Val lie Phe Glu Glu Pro Val Ser Lys Gly Phe Val 
85 90 95 

Glu Asn Asn Pro Asp Leu Ala lie Ala Asp Leu Asn Phe Glu Lys Glu 
100 105 110 

Gin Asp Asp Ser Tyr Ala Val Ala Met Lys Lys Asp Ser Lys Lys Leu 
.115 120 125 

Lys Arg Gin Phe Asp Lys Thr lie Gin Lys Leu Lys Glu Ser Gly Glu 
130 135 140 

Leu Asp Lys Leu lie Glu Glu Ala Leu 
,145 150 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
— - (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
CTGTGAGAAT CAAGCTACAC CCAAAGAGAC TAGCGCTCAA AAGACAATCG TCCTTGCTAC 60 



■ AGCTGGCGAC GTGCCACCAT TTGACTACGA AGACAAGGGC "AATCTGACAG GCTTTGATAT 120 
CGAAGTTTTA AAGGCAGTAG ATGAAAAACT CAGCGACTAC GAGATTCAAT TCCAAAGAAC 180 



333 

CGCCTGGGAG AGCATCTTCC CAGGACTTGA TTCTGGTCAC TATCAGGCTG CGGCCAATAA 240 

CTTGAGTTAC ACAAAAGAGC GTGCTGAAAA ATACCTTTAC TCGCTTCCAA TTTCCAACAA 300 

TCCCCTCGTC" CTTGTCAGCA ACAAGAAAAA TCCTTTGACT TCTCTTGACC AGATCGCTGG 3 60 

TAAAACAACA CAAGAGGATA CCGGAACTTC TAACGCTCAA TTCATCAATA ACTGGAATCA 42 0 

GAAACACACT GATAATCCCG CTACAATTAA TTTTTCTGGT GAGGATATTG GTAAACGAAT 480 

CCTAGACCTT GCTAAC GGAG AGTTTGATTT C C TAGTTTTT GACAAGGTAT CCGTTCAAAA 540 

GATTATCAAG GACCGTGGTT TAGACCTCTC AGTCGTTGAT TTACCTTCTG CAGATAGCCC 600 

CAGCAATTAT ATGATTTTCT CAAGCGACCA AAAAGAGTTT AAAGAGCAAT TTGATAAAGC 660 

GCTCAAAGAA CTCTATCAAG ACGGAACCCT TGAAAAACTC AGCAATACCT ATCTAGGTGG " 720 

TTCTTACCTC CCAGATCAAT CTCAGTTACA A -' 7 51 



(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

Cys Glu Asn Gin Ala Thr Pro Lys Glu Thr Ser Ala Gin Lys Thr lie 
1 5 10 15 

Val Leu Ala Thr Ala Gly Asp Val . Pro Pro Phe Asp Tyr. Glu Asp Lys 
20 25 -30 

Gly Asn Leu Thr Gly Phe Asp lie Glu Val Leu Lys Ala Val Asp Glu 
35 40 45 

Lys Leu Ser Asp Tyr Glu He Gin Phe Gin Arg Thr Ala Trp Glu Ser 
50 55 60 

He Phe Pro Gly Leu Asp Ser Gly His Tyr Gin Ala Ala Ala Asn Asn 
65 .70 75 80 

Leu Ser Tyr Thr Lys Glu Arg Ala Glu Lys Tyr Leu Tyr Ser Leu Pro 
85 90 .95 

He Ser Asn Asn Pro Leu Val Leu Val Ser Asn Lys Lys Asn Pro Leu 
100 105 110 

Thr Ser Leu Asp Gin He Ala Gly Lys Thr Thr Gin Glu Asp Thr Gly 
115 , 120 125 

Thr Ser Asn Ala~"Gln Phe He Asn Asn Trp Asn Gin Lys His Thr Asp 
130 ~ 135 140 



Asn Pro Ala Thr He Asn Phe Ser Gly Glu Asp He Gly Lys Arg He 
145 150 155 160 



Leu Asp Leu Ala Asn Gly Glu Phe Asp Phe Leu Val Phe Asp Lys Val 
165 170 175 



Ser Val Gin Lys lie Ile_ Lys Asp Arg Gly Leu Asp Leu Sef Val Val 
180 " 185 190 

Asp Leu Pro" Ser Ala Asp Ser Pro Ser Asn Tyr lie lie Phe Ser Ser 
195 200 205 

Asp Gin Lys Glu Phe Lys Glu Gin Phe Asp Lys Ala Leu Lys Glu Leu 
210 215 220 

Tyr Gin Asp Gly Thr Leu Glu Lys Leu Ser Asn Thr Tyr Leu Gly Gly 
225 230 235 240 

Ser Tyr Leu Pro Asp Gin Ser Gin Leu Gin 
245 250 

(2) INFORMATION FOR SEQ ID NO: 227: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 



GACTGGATCC TAAAATCTAC GACAATAAAA ATC 
(2) INFORMATION FOR SEQ ID NO : 22 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
CTGAGTCGAC TGGTTGTGCT GGTTGAG 
(2) - INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double - 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
GTCAGGATCC AAATTACAAT ACGGACTATG " 
( 2 ^INFORMATION FOR SEQ ID NO : 23 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CAGTGTCGAC TAACTCTAGG . TCGGAAAC 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC TGAGAATCAA GCTACACCCA AAGAG 
(2) INFORMATION FOR SEQ ID NO : 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAAGCTT TTGTAACTGA GATT GATCTG G 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GACTGGATCC TGGTAACCGC TCTTCTCGTA ACGCAGC 

(2) INFORMATION FOR SEQ ID NO : 234: 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear "V 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 



AGTCAAGCTT TTTCAGGAAC TTTTACGCTT CC 
(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
AGTCAGATCT TGTGGAAATT TGACAGGTAA CAGCAAAAAA GCTGC 
(2) INFORMATION FOR SEQ ID NO: 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
ACTGAAGCTT TTTTGTTTTT CAAGAATTCA TCG 
(2) INFORMATION FOR SEQ ID NO: 23 7: ' ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 
GACTGGATCC TGGTCAAGGA ACTGCTTCTA AAGAC 
(2) INFORMATION FOR SEQ ID NO : 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid^ 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 8: 
AGTCAAGCTT T C AC AAATTC GTTGGTGAAG CC 
(2) INFORMATION FOR SEQ ID NO: 239: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 9 
GACTGGATCC TAGCTCAGGT GGAAACGCTG GTTCATCC 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 240 
AGTCAAGCTT ATCAACTTTT CCACCTTCAA CAACC 
(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

, (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 
GTCAAGATCT CTCCAACTAT GGTAAATCTG CGGATGG 
(2) INFORMATION FOR SEQ ID NO : 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 
AGTCCTGCAG ATCCACATCC GCTTTCATCG GGTTAAAGAA GG 
(2) INFORMATION FOR SEQ ID NO : 2 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
~(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
GAC TGGATCC TGGGAAAAAT TCTAGCGAAA CTAGTGG 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 
GTCACTGCAG CTGTCCTTCT TTTACTTCTT TGGTTGC 
(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 245 
GACTGGATCC TGCTAGCGGA AAAAAAGATA CAACTTCTGG 
(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO : 246 
CTGAAAGCTT TTTTGCCAAT CCTTCAGCAA TCTTGTC 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 
GAC TAGATCT TGGCTCAAAA AATACAGCTT CAAGTCC 
(2) INFORMATION FOR SEQ ID NO : 248: c 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 
< D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 

AGTCCTGCAG GTTTTTGTTT GCTTGGTATT GGTCG 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH': 41 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 
GACTGGATCC TAGTACAAAC TCAAGCACTA GTCAGACAGA G 
(2) INFORMATION FOR SEQ ID NO: 250: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 
CAGTCTGCAG TTTCAAAGCT TTTTGTATGT CTTC 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 251 

GACTGGATCC TGGCAATTCT GGCGGAAGTA AAGATGC 

(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 
/ (A) LENGTH: 3 6 base pairs 
(B) TYPE: nucleic acid 

- (C1_STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 



AGTCAAGCTT GTTTCATAGC TTTTTTGATT GTTTCG 
(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 
GACTGGATCC TTCACAAGAA AAAACAAAAA ATGAAGATGG 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 254 
AGTCAAGCTT ATCGAC GTAG TCTCCGCCTT C 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 
GACTGGATCC GAAAGGTCTG TGGTCAAATA ATCTTACC 
(2) INFORMATION FOR SEQ ID NO : 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 
AGTCAAGCTT AGAGTT AACA TG G TGC TTGC CAATAGG 
(2) INFORMATION FOR SEQ ID NO: 257: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC AAACTCAGAA AAGAAAGCAG ACAATGC 
(2) INFORMATION FOR SEQ ID NO : 2 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGTCAAGCTT CCAAACTGGT TGATCCAAAC CATCTG 
(2) INFORMATION FOR SEQ ID NO : 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACTGGATCC TTCGAAAGGG TCAGAAGGTG CAGACC 
(2) INFORMATION FOR SEQ ID NO:. 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

AGTCAAGCTT CTGTAGGCTT GGTGTGCCCC AGTTGC 

(2) INFORMATION FOR SEQ ID NO:' 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 
: .— type : nucleic acid 

(C) STRANDEDNESS: double 

~~ " (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 



CTGAGGATCC GGGGATGGCA GCTTTTAAAA ATC 



33 



(2) INFORMATION FOR SEQ. ID NO : 2 62: 



"(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 



CAGTAAGCTT GTTTACCCAT TC AC C ATT AC C 



31 



Q 



(2) INFORMATION FOR SEQ ID NO: 263: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 



(2) INFORMATION FOR SEQ ID NO : 264: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
TCAGAAGCTT GTTTAC'CCAT TCACCATT 28 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
GACTGGATCC CTGTGGTGAG GAAGAAACTA AAAAG : _ . 35 



(2) INFORMATION FOR SEQ ID NO : 266: 



CAGTGGATCC AGACGAGCAA AAAATTAAG 



29 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) " STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 266: 
CTGAGTCGAC AATATTCTGT AGGAATGCTT CGAATTTG 3 8 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 267: 
CTGAGGATCC GACTTTTAAC AATAAAACTA TTGAAGAG 3 8 

(2) INFORMATION FOR SEQ ID NO : 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
GTCACTGCAG GTTGTCACCT CCAAAAATCA CGG 33 
(2) INFORMATION FOR SEQ ID NO : 269: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
GACTGGATCC CTTTACAGGT AAACAACTAC AAGTCGG - 37 
(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double " ^ 

(D) TOPOLOGY: linear 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 
CAGTAAGCTT TTCGAAGTTT GGCTCAGAAT TG . 32 



(2) INFORMATION FOR SEQ ID NO : 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 
GACTGGATCC CCAGGCTGAT ACAAGTATCG CA ,32 
(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
CAGTAAGCTT ATCTGCAGTA TGGC TAG ATG G 31 
(2) INFORMATION FOR SEQ ID NO : 273: 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
GACTGGATCC GTCTGTATCA TTTGAAAACA AAGAAAC 37 
(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double ----- 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
CAGTCTGCAG TTTTACTGTT GCTGTGCTTG TG ~ 32 



(2) INFORMATION FOR SEQ ID_NO : 27 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 
ACTGAGATCT TGGTCAAAAG GAAAGTCAGA CAGGAAAGG 
(2) INFORMATION FOR SEQ ID NO : 27 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 
CAGTAAGCTT ATTCCTGAGC TTTTTTGATA AAGGTTGCGC A 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 
ACTGGGATCC GAAGGATAGA TATATTTTAG CATTTGAGAC 
(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 
AGTCAAGCTT CCATGGTATC AAAGGCAAGA " CTTGG 
(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27 9 
GTCAGGATCC GGTAGTTAAA GTTGGTATTA ACGG 
(2) INFORMATION FOR SEQ ID NO : 28 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 280 
AGTCAAGCTT GCAATTTTTG CGAAGTATTC CAAGAG 
(2) INFORMATION FOR SEQ ID NO : 281: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 
AGTCGGATCC TTCTTACGAG TTGGGACTGT ATCAAGC 
(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 
AGTCAAGCTT GTTTATTTTT TCC TTACTTA CAGATGAAGG 
(2) INFORMATION FOR SEQ ID NO : 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 
AGTCGGATCC TACTGAGATG CATCATAATC TAGGAGC 
(2) INFORMATION FOR SEQ ID NO: 284: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE:' nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 
TCAGCTCGAG TTCTTTGACA TCTCCATCAT AAGTCGC 
(2) INFORMATION FOR SEQ ID NO : 28 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 5 
GACTGGATCC GGTTTTGAGA AAGTATTTGC AGGGG 
(2) INFORMATION FOR SEQ ID NO : 28 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: . linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
. CAGTAAGCTT . GGATTTTTTC ATGGATGCAA TTTTTTTGG 
(2) INFORMATION FOR SEQ ID NO : 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
GACTGGATCC GACAACATTT ACTATCCATA CAGTAGAGTC AGC 
(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 
GAC TAAGCTT GGCATAAGGT TGCAATTCTG GATTAATTGG 
(2) INFORMATION FOR SEQ ID NO : 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 
GAC TGGATCC GGCTAAGGAA AGAGTGGATG 
(2) INFORMATION FOR SEQ ID NO : 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 
GACTAAGCTT TTCATTTTTA AATTGACTAT GCGCCCG 
(2) INFORMATION FOR SEQ ID NO : 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 291 
GACTGGATCC TTGTTCCTAT GAACTTGGTC GTCACC 
(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs. . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 292 
CATGAAGCTT ATCCTGGATT TTTCCAAGTA AATCT 
(2) INFORMATION FOR SEQ ID NO: 293: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
GACTGGATCC TTATAAGGGT GAATTAGAAA AAGG 3 4 

(2) " INFORMATION FOR SEQ ID NO : 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94: 

GACTAAGCTT CTTATTAGGA TTGTTAGTAG TTG 33 

(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 37 base pairs 
_ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
GACTGGATCC GAATGTTCAG GCTCAAGAAA GTTCAGG 37 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
GACTAAGCTT TTCCCCTGAT GGAGCAAAGT AATACC 3 6 

(2) INFORMATION FOR SEQ ID NO: 297: 

. (i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 4 0 base pairs ~ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS.: „„do_uble 

(D) TOPOLOGY: linear - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97 
G AC TGGATC C . CTTGGGTGTA ACCCATATCC AGCTCCTTCC 
. .(2) INFORMATION- FOR SEQ ID NO: 298: 

(i), SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 98 
GACTGTCGAC TTCAGCTTGT TTATCTGGGG TTGC 
(2) INFORMATION FOR SEQ ID NO: 299: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 99 
GAC TGGATCC TAGTGATGGT ACTTGGCAAG GAAAACAG 
(2) INFORMATION FOR SEQ ID NO : 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 
ACTGCTGCAG ATCTTTGCCA CCTAGCTTCT CATTG 
(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 01 
GTCAGGATCC TGGGATTCAA TATGTCAGAG ATGATACTA G - 
(2) INFORMATION FOR SEQ ID NO: 302: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 
CTAGAAGCTT ACGCACCCAT TCACCATTAT CATTG 
(2) INFORMATION FOR SEQ ID NO : 3 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 03 
GTCAGGATCC GGATAATAGA GAAGCATTAA AAACC 
(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-: 3 04 
AGTCAAGCTT GACAAAATCT TGAAACTCCT CTGGTC 
(2) INFORMATION FOR SEQ ID NO : 3 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
,(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 
GTCAGGATCC AGATTTTGTC GAGGAGTGTC ATACC 
(2) INFORMATION FOR SEQ ID NO: 3 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid ~ 

(C) ^STRANDEDNESS : double .. 

(D) -TOPOLOGY: linear . 
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(xi) SEQUENCE DESCRIPTION: ' SEQ ID NO: 306: 

AGTCAAGCTT TCCCTTTTTA CCCTTACGAA TCCAGG 3 6 

(2) INFORMATION FOR SEQ ID NO: 3 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 07: 
GACTGGATCC ATCTGTAGTT TATGCGGATG AAACACTTAT TAC 43 
(2) INFORMATION FOR SEQ ID NO: 308: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 
GACTGTCGAC GCTTTGGTAG AGATAGAAGT CATG 34 
(2) INFORMATION FOR SEQ ID NO: 3 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 09: 
GACTGGATCC TTACTTTGGT ATCGTAGATA CAGCCGGC 3 8 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double ~ — -■ 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
GACTGGATCC AGCTAAGGTT GCATGGGATG CGATTCG 37 
(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
GAC TGTCGAC CTGGGCTTTA TTAGTTTGAC TAGC 34 
(2) INFORMATION FOR SEQ ID NO : 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
CAGTGGATCC CTATCACTAT GTAAATAAAG AGA 33 
(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

ACTGAAGCTT TTCTGTCCCT GTTTGAGGCA 30 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double _ 

(D) TOPOLOGY: linear ? 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 
CAGTGGATCC TGAGACTCCT CAATCAATAA CAAA 
(2)' INFORMATION FOR SEQ ID NO : 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 
ACGTAAGCTT ATAATCAGTA GGAGAAACTG AACT 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 
CAGTGGATCC GGATGCTCAA GAAACTGCGG 

(2) INFORMATION FOR SEQ ID NO: 318: — 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318 
GACTAAGCTT -TTGCCTCTCA TTCTTGCTTC C 
(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 
CAGTGGATCC CGACAAAGGT GAGACTGAG 
(2) INFORMATION FOR SEQ ID NO: 32 0: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320 
ACGTAAGCTT ATTTCTTAAT TCAAGTGTTT TCTCTG 
(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 
GAC TGGATCC AAATCAATTG GTAGCACAAG ATCC 
(2) INFORMATION FOR SEQ ID NO : 3 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 
CAGTGTCGAC ATTAGGAGCC ACTGGTCTC 
(2) INFORMATION FOR SEQ ID NO: 323: 

\(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 23 
CAGTGGATCC CAAACAGTCA GCTTCAGGAA C 
(2) INFORMATION FOR SEQ ID NO : 324: 

. . (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
GAC TCTGC AG TTTAATCTTG TCCCAGGTGG 3 0 

(2) INFORMATION FOR SEQ ID NO: 3 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 
GAC TGGATC C ATTCGATGAT GCGGATGAAA AG .32 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 
GACTAAGCTT CATTTGTCTT TGGGTATTTC GCA 33 
(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 
C AGTGGATC C GGAGAGTCGA TCAAAAGTAG 3 0 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
_GTCACTGCAG TTGCTCGTCT CGAGGTTC 
(2) INFORMATION FOR SEQ ID NO : 329: 



28 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329 
CAGTGGATCC ATGGACAACA GGAAACTGGG AC 
(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 33 0 
CAGTAAGCTT ATTAGCTTCT GTACCTGTGT TTG 
(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 331 
GACTGGATCC CGATGGGCTC AATCCAACCC CAGGTCAAGT C 
(2) INFORMATION FOR SEQ ID NO: 332: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 
GACTCTGCAG CATAGCTTTA TCCTCTGACA TCATCGTATC 
(2) INFORMATION FOR SEQ ID NO : 333: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

* - (C) STRANDEDNESS : double 

CD) TOPOLOGY: linear 



358 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 3 
GACTGGATCC TTCCAATCAA AAACAGGCAG. ATGG 
(2) INFORMATION FOR SEQ ID NO : 3 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 334 
GACTAAGCTT GAGTCCCATA GTCCAAGGCA 
(2) INFORMATION FOR SEQ ID NO: 33 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 5 
AGTCGGATCC TATCACAGGA TCGAACGGTA AGACAACC 
(2) INFORMATION FOR SEQ ID NO : 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 33 6 
ACTGGTCGAC TTCTTTTAAC TCCGCTACTG TGTC 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 337 

CAGTGGATCC AAGTTCATCG AAGATGGTTG GGAAGTCC 

(2) INFORMATION FOR SEQ ID NO: 338:. 




(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 32 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 
__■ (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 33 8 
GATCGTCGAC CCGCTCCCAC ATGCTCAACC TT 
(2) INFORMATION FOR SEQ ID NO : 33 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 
TGACGGATCC ATCGCTAGCT AGTGAAATGC AAGAAAG 
(2) INFORMATION FOR SEQ. ID NO : 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 
TGACAAGCTT ATTCGTTTTT GAACTAGTTG CTTTCGT 
(2) INFORMATION FOR SEQ ID NO: 341 : " 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic, acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 
GAC TGGATCC GCACCAGATG GGGCACAAGG TTCAGGG 
(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE : "nucleic acid 

(C) STRANDEDNESS: double 
"... (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 
TGACAAGCTT AACTTGTAAC GAACAGTTCA ATCTG 3 5 

(2) INFORMATION FOR SEQ iff NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
'(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 
GACTAGATCT TTTTAACC C A ACTGTTGGTA CTTTCC 36" 
(2) INFORMATION FOR SEQ ID NO: 344: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 344: 
TGACAAGCTT GTTAGGTGTT ACATTTTGAC CGTC 34 
(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: . ' 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid " ' . 

(C) STRANDEDNESS: double 

(D) .TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345:. 
ACTGAGATCT TTTTAACCCA ACTGTTGGTA CTTTC 35 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double — 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 



GACTAAGCTT TCTACGATAA CGATCATTTT CTTTACC 37 
(^ INFORMATION FOR SECT ID NO:' 347: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 47: 
GACTGTCGAC TCGTAGATAT TTAAGTCTAA GTGAAGCG 38 
(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 
AGTCAAGCTT GTTAGGTGTT ACATTTTGCA AGTC 34 
(2) INFORMATION FOR SEQ ID NO : 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE:' nucleic acid 
'CO STRANDEDNESS: double 

CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
GACTGGATCC CTTTGGTTTT GAAGGAAGTA AG 32 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TGACCTGCAG ACGATTTTTG AAAAATGGAG GTGTATC 37 

(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double ■ ~ 

(D) TOPOLOGY: linear ' ~\ " 



362 



■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 51 
CAGTGGATCC CTACTACCTC TCGAGAGAAA G 
(2) INFORMATION FOR SEQ ID NO : 3 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 
ACTGAAGCTT TTCGCTTTTT AC TC GTTTG A CA 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 
CAGTGGATCC TAAGGTCAAA AGTCAGACCG CTAAGAAAGT GC 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 
CAGTAAGCTT TAGGGTATCC AAATACTGGT TGTTGATG ' 
(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 
TGACAGATCT TGACGGGTCT CAGGATCAGA CTCAGG 
"(2) INFORMATION FOR SEQ ID NO : 356: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 
TGACAAGCTT CAAAGACATC CACCTCTTGA CCTTTG 
(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: double 

(D) • TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 57 
GACTGGATCC TAGAGGCTTT GCCAAATGGT GGGAAGGG 
(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 
GTCAGTCGAC TTGTTGTAAC ACTTTTCGAG GTTTGGTACC 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

CAGTGGATCC T C AAAAAG AG AAGGAAAACT TGG 

(2) INFORMATION FOR SEQ ID NO : 360: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 34 base pairs 
.(B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 69: 
GACTGGATCC GTCCGGCTCT GTCCAGTCCA CTTTTTCAGC G 41 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAGAAGCTT ATTTTTTGTT TCCTTAATGC GTT ° -* 33 

(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : ' double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 
GACTGGATCC GGGACAAATT CAAAAAAATA GGCAAGAGG 3 9 

(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION :. SEQ ID NO: 372: 
GTCAAAGCTT TGGCTCTTTG ATTGCCAACA ACTG 34 
'(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION :. SEQ ID NO: 373 : 
GACTGGATCC TCGCTAGCAG -GAAGAAAGGG AGCAAAAGG 39 
(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65 
CTGAGGATCC AATTGTACAA TTAGAAAAAG ATAGC 
(2) INFORMATION FOR SEQ ID. NO : 3 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
"(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 66 
TGACAAGCTT GCGTTGACTA GGTTCTGCAA TGCC 
(2) INFORMATION FOR SEQ ID NO:* 3 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 367 
GACTGGATCC TCTGACCAAG CAAAAAGAAG CAGTCAATGA 
(2) INFORMATION FOR SEQ ID NO : 3 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 
TCAGCAGCTG ATCATTGACT TTACGATTTG CTCC 
(2) INFORMATION FOR SEQ ID NO: 3 69: 

(i).- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs , 

(B) TYPE: nucleic acid 

— — -(€-)- STRANDEDNESS: double . 

- '(D) TOPOLOGY: linear . 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
GACTAAGCTT ACTTTTTTCT TTTTCCACAC GA 32 
(2)- INFORMATION FOR SEQ ID NO: 37 5: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 
CAGTGGATCC GAACCGACAA GTCGCCCACT ATCAAGACT 3 9 

(2) INFORMATION FOR SEQ ID NO : 37 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 
CTGAAAGCTT TGAATTCTCT TTCTTTTCAG GCT - 33 

(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

TCGAGGATCC GGTTGTCGGC TGGCAATATA TCCCGT 3 6 

(2) INFORMATION FOR SEQ ID NO: 37 8: 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 
--" (C) STRANDEDNESS : double— — — 

(D): -TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: - 
CAGTCTGCAG TTTCTTCAAC AAACCTTGTT CTTG 34 
(2) INFORMATION FOR SEQ ID NO: 3 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs. 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: . linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
CAGTGGATCC ACGTTCTATT GAGGACCACT T 31 
(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62: 
CAGTAAGCTT TTCCTTCTCA GTCAATTCTT TTCC 34 
(2) INFORMATION FOR SEQ ID NO: 3 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 
GACTGGATCC CGCTCAAAAT AC C AGAGGTG TTCAG 35 
(2) INFORMATION FOR SEQ ID NO: 3 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs ' ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
GACTAAGCTT AGTACCATGG GTGTGACAGG TTTGAA - 36 
(2) INFORMATION FOR SEQ ID NO: 3 65: 



• 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



CAGTAAGCTT CCGAACCCAT TC GC C ATT AT AGTTGAC 
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(2) INFORMATION FOR SEQ ID NO : 379: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 379: 
AGTCGGATCC GGCCAAATCA GAATGGGTAG AAGAC 35 
(2) INFORMATION FOR SEQ ID NO: 380: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 
GACTGGATCC ATTTGCAGAT GATTCTGAAG GATGG 35 
(2) INFORMATION FOR SEQ ID NO: 382: 

. (i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

TCAGCTGCAG CTTAACCCAT TCACCATTCT AGTTTAAG 3 8. 

(2) INFORMATION FOR SEQ ID NO: 383: 




TGACCTGCAG CTTCTCATTG ATTTTC ATC A TCAC 



(2) INFORMATION FOR SEQ ID NO : 381: 



34 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double " _ 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 
GACTGGATCC TGTCGCTGCA AATGAAACTG AAGTAGC .37 
(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



"(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 
GACTAAGCTT ATACCAAACG CTGACATCTA CGCG 34 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. . .. - (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AGTCAGATCT TACGTCTCAG CCTACTTTTG TAAGAGC 37 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 34~ base pairs - 

(B) TYPE: nucleic acid , 

(C) .STRANDEDNESS: double 

(D) TOPOLOGY: linear 



~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
GACTAAGCTT AACCCATTCA CCATTGGCAT TGAC 34 
(2) INFORMATION FOR SEQ ID. NO: 387: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs .„ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ' . - 

(D) TOPOLOGY: linear"" . --_ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
CAGTGGATCC TGGACAGGTG AAAGGTCATG CTACATTTGT G" 
(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
GACTAAGCTT CAACCATTGA GACCTTGCAA CAC 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 89: 
GTCAGGATCC GATTGCTCCT TTGAAGGATT TGAGAGAAAC C 
(2) INFORMATION FOR SEQ ID. NO: 390: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 90: 
GACTAAGCTT CGATCAAAGA TAAGATAAAT ATATATAAAG T 
(2) INFORMATION FOR SEQ ID NO: 391: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 
GAC TGGATCC TAGGTCATAT GGGACTTTTT TTCTACAACA AAATAGG 



(2) INFORMATION FOR SEQ ID~NO : 392: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double , 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

TGACAAGCTT ATCTATCAGC TCATTTAATC GTTTTTG . 37 

(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
, (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
CTGAGGATCC CAACGTTGAG AATTATTTGC GAATG 3 5 

(2) INFORMATION FOR SEQ ID NO: 3 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear _. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94: 
* 

TGACAAGCTT GAGTCTACAA AAGTAATGTA C 31 
(2) INFORMATION FOR SEQ ID NO: 3 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 
GTCAGGATCC CTACTATCAA TCAAGTTCTT CAGCC 35 
(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double — 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 
TGACAAGCTT GACTGAGGCT TGGACCAGAT TGAAAAG ~ 37 



(2) INFORMATION FOR SEQ ID NO : 3 97: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 6 base pairs 
. (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 
GACTGGATCC GACAAAAACA TTAAAACGTC CTGAGG . 3 6 

(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 
GACTAAGCTT AGC AC GAACT GTGACGCTGG TTCC 34 
(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 
GACTGGATCC TTCTCAGGAG ACCTTTAAAA ATATC 35 
(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
-(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 
GACTAAGCTT GTTGGC CATC TTGTACATAC C 31 
(2) INFORMATION FOR SEQ ID NO: 401: 



373 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

- (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 401: 
GACTGGATCC AGTAAATGCG CAATCAAATT C 31 
(2) INFORMATION FOR SEQ ID NO:. 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear. 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

;|f j AGTCCTGCAG GTATTTAGCC CAATAATCTA TAAAGCT 37 

SJ (2) INFORMATION FOR SEQ ID NO; 403: 

3i (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 32 base pairs 

y]7 ( B > TYPE: nucleic acid 

(C) STRANDEDNESS: double 
LH (D) TOPOLOGY: linear . 

m - ■ ■ ■ " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 403: 



CAGTGGATCC TTACCGCGTT CATCAAGATG TC 32 
(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 



GACTAAGCTT GCCAGATGTT GAAAAGAGAG TG .32 

(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

- "(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 
GACTGGATCC GTGGATGGGC TTTAAC TATC TTCGTATTCG 
(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
■(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 406 
AGTCAAGCTT GCTAGTCTTC ACTTTCCCTT TCC 
(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 
GACTGTC GAC ACTAAACCAG CATCGTTCGC AGGA 
(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS-: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 
CTGACTGCAG CTTCTTGAAG AAATAATGAT TGTGG 
(2) INFORMATION FOR SEQ ID NO : 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 
CAGTGGATGG— TGAG-TAGGTT GAAATC CCAC TT 
(2) INFORMATION FOR SEQ ID NO: 410: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE.: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 
CAGTAAGCTT TTTTTTAAGG TTGTAGAATG ATTTCAATC - 3 9 

(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 411: 
C AGTGTC G AC TCGTATCTTT TTTTGGAGCA ATGTT 35 
(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: • 
GACTAAGCTT AAATGTTCCG ATACGGGTGA TTG 33 
(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 
CAGTGGATCC GGACTCTCTC AAAGATGTGA AAG 33 
(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS-:— doubl-e = — 

— r- (D) TOPOLOGY: linear ■-- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 
GACTAAGCTT" CTTGAGTTTG TCAAGGATTG CTTT 
(2) INFORMATION FOR- SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



"(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 
CAGTGGATCC CAAGAAATCC TATCATCTCT TCCAGAAG 
(2) INFORMATION FOR SEQ ID NO': .416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 
GACTAAGCTT TTCAGAACTA AAAGCCGCAG CTT 
(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
, (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 
GAC TGGATCC ACGAAATGCA GGGCAGACAG 
(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
"(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 
CAGTAAGCTT ATCAACATAA TCTAGTAAAT AAGCGT 



(2) INFORMATION FOR SEQ ID NO: 419: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419 
CAGTGGATCC TGTATAGTTT TTAGCGCTTG TTCTTC 
(2) INFORMATION FOR SEQ ID NO : 420: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 0 
GTCAAAGCTT TGATAGAGTG TCATAATCTT CTTTAG 
(2) INFORMATION FOR SEQ ID NO : 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 
GACTGGATCC GTGTGTCGAG CATATTCTGA AG 
(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 
CAGTAAGCTT ACTTTTACCA TTTCTTTGTT CTGCATC 
(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base.. pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS: double- 

(D) TOPOLOGY: linear ' — 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 
GACTGTCGAC GTGTTTGGAT AGCATTCAGA ATCAGACG 
42) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 
CAGTAAGCTT CGGAAGTAAA GACAATTTTT CC 
(2) INFORMATION FOR SEQ ID NO: 42 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
CAGTGGATCC GTGCCTAGAT AGTATTATTA CTCAAAC 
- (2) INFORMATION FOR SEQ ID NO : 42 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 6: 
GACTAAGCTT TTTGCTTATT TCTCTCAATT TTTC 
(2) INFORMATION FOR S£Q ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
CAGTGGATCC CATTCAGAAG CAGACCTATC AAAATC 
(2) INFORMATION FOR SEQ ID NO : 428: 
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(i) SEQUENCE CHARACTERISTICS :. 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
ACTGAAGCTT ATGTAATTTT TTAGATTTTT CAATATTTTT CAG 
(2) INFORMATION FOR SEQ ID NO: 429: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 429: 
AGTCGGATCC TAAGGCTGAT AATC GTGTTC AAATG 
(2) INFORMATION FOR SEQ ID NO: 43 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 
GACTAAGCTT AAAATTAGAT AG AC GTTGAG T 
(2) INFORMATION FOR "SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 
AGTCGGATCC C TGTGGC AAT CAGTCAGCTG CTTCC 
(2) INFORMATION FOR SEQ ID NO: 432: 

' - (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 
GACTGTCGAC TTTAATCTTG TCCCAGGTGG TTAATTTGCC 
(2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 
ACTGGTCGAC TTGTCAACAA CAACATGCTA CTTCTGAG 
(2) INFORMATION FOR SEQ ID NO : 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 434: 

GACTCTGCAG AAGTTTAACC CACTTATCAT TATCC 

(2) INFORMATION FOR SEQ ID NO: 43 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A), LENGTH: 3 9 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 435: 
ACTGGGATCC TTGTTCAGGC AAGTCCGTGA CTAGTGAAC 
(2) INFORMATION FOR SEQ ID NO: 43 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: '33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 43 6: 
GAC TAAGCTT GGCTAATTCC TTCAAAGTTT GCA 
(2) INFORMATION FOR SEQ ID NO : 437: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 7 

AGTCGGATCC CTCGCAAATT GAAAAGGCGG CAGTTAGCC 

(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 8 
GACTAAGCTT GTAAATAAGC GTACCTTTTT CTTCC 
(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TCAGGGATCC TTGTCAGTCA GGTTCTAATG GTTCTCAG 

(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
" (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 
AGTCAAGCTT GGCATTGGCG TCGCCGTCCT TC 
(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 441 
GACTGGATCC GGAAACTTCA CAGGATTTTA AAGAGAAG 
(2) INFORMATION FOR SEQ ID NO : 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 

GACTGTCGAC AATCAATCCT TCTTCT'GCAC TTCT 

O (2) INFORMATION FOR SEQ ID NO: 443: 

%j ' (i) SEQUENCE CHARACTERISTICS : 

^ (A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
H\ (C) STRANDEDNESS : double 

11 y (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 
CAGTGGATCC TGTGGTCGAA GTTGAGACTC CTCAATC 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i). SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 
GACTAAGCTT TTCTTCAAAT TTATTATCAG C 
(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445 
-AGTCGGATCC AACACCTGTA TATAAAGTTA CAGCAATCG 
(2) INFORMATION. FOR SEQ ID NO: 44 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 44 6 
GACTGTCGAC TACTTGACCG AATGCGTCGA ATGTACG 
(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 
CTGAGGATCC ATTAGACAGA TTAATTGAAA TCGG 
(2) INFORMATION FOR SEQ ID NO : 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 
GACTGTCGAC TTTAAAGATT GAAGTTTTAA AGCT 
(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 
TGACGGATCC TAAGACAGAT GAAC GGAGC A AGGTG 
(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 
(C ) — STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 
CTGAAAGCTT TAAGGCTTCC TCAATGAGTT TGTCT 
(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 451: 
GACTGGATCC CTGTGAGAAT CAAGCTACAC CCA 
i 2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 
CTGAAAGCTT TTGTAACTGA GATTGATCTG GGAG 



